COMPOSITIONS, ORGANISMS AND METHODOLOGIES 
EMPLOYING A NOVEL HUMAN PROTEIN PHOSPHATASE 



[0001] The present invention incorporates by reference U.S. Provisional Application 
Serial No. 60/420,757, filed October 24, 2003 and entitled "Compositions, Organisms and 
Methodologies Employing a Novel Human Phosphatase." 

FIELD OF THE INVENTION 

[0002] The present invention relates to compositions, organisms and methodologies 
employing a novel human protein phosphatase, calcineurin-like protein phosphatase 1 
(CLPP1), which has 91% sequence homology to the consensus sequence of the calcineurin- 
like phosphoesterase family. This invention can be used for diagnosing, prognosing, and 
treating phosphatase-related diseases and, in particular, diseases associated with aberrant 
expression of CLPP1. 

BACKGROUND OF THE INVENTION 

[0003] Protein phosphorylation/dephosphorylation plays a central role in the regulation 
of a variety of cell functions, such as cell proliferation, differentiation, and signaling 
processes. Uncontrolled signaling has been implicated in a variety of disease conditions 
including inflammation, cancer, arteriosclerosis, and psoriasis. It is estimated that more 
than 1 ,000 of the 1 0,000 proteins active in a typical mammalian cell are phosphorylated. As 
is well-known in the art, high energy phosphate, which drives activation, is generally 
transferred from adenosine triphosphate molecules (ATP) to a particular protein by protein 
kinases and removed from that protein by protein phosphatases. 

[0004] The presence or absence of a phosphate moiety modulates protein function in 
multiple ways. A common mechanism involves changes in the catalytic properties (Vmax 
and Km) of an enzyme, leading to its activation or inactivation. 

[0005] A second widely recognized mechanism involves promoting protein-protein 
interactions. An example of this is the tyrosine autophosphorylation of the ligand-activated 
EGF receptor tyrosine phosphatase. This event triggers the high-affinity binding to the 
phosphotyrosine residue on the receptor's C-terminal intracellular domain to the SH2 motif 
of an adaptor molecule Grb2. Grb2, in turn, binds through its SH3 motif to a second adaptor 
molecule, such as SHC. The formation of this ternary complex activates the signaling 



events that are responsible for the biological effects of EGF. Serine and threonine 
phosphorylation events also have been recently recognized to exert their biological fiinction 
through protein-protein interaction events that are mediated by the high-affinity binding of 
phosphoserine and phosphothreonine to WW motifs present in a large variety of proteins. 
[0006] A third important outcome of protein phosphorylation is changes in the 
subcellular localization of the substrate. As an example, nuclear import and export events 
in a large diversity of proteins are regulated by protein phosphorylation. 
[0007] Reversible protein phosphorylation is an essential regulatory mechanism in 
many cellular processes. While the post-translational modification alters the properties of 
key regulatory proteins involved in various biochemical pathways, protein kinases and 
phosphatases themselves are subject to control through the action of extracellular signals 
such as hormones and growth factors. Although much attention has been paid to the 
regulation of protein kinases, it is now apparent that protein phosphatases are also highly 
regulated enzymes that play an equally important role in the control of protein 
phosphorylation. 

[0008] Protein phosphatases may be roughly divided into three families based on their 
substrate: the serine/threonine (S/T) phosphatases, the tyrosine phosphatases, (PTP), and 
dual specificity phosphatases. 

[0009] Serine/threonine (S/T) protein phosphatases (PPases) catalyse the 
dephosphorylation of phosphoserine and phosphothreonine residues. Their action is 
opposed to that of a large number of serine/threonine protein kinases. In mammalian tissues 
four different types of PPase have been identified and are known as PP1, PP2A, PP2B and 
PP2C. Except for PP2C, these enzymes are evolutionary related. The catalytic regions of 
the proteins are well conserved and have a slow mutation rate, suggesting that major 
changes in these regions are highly detrimental. 

[0010] Protein tyrosine phosphatases (PTPs) catalyse the dephosphorylation of 
phosphotyrosine residues. PTPs represent a large family of enzymes that play a very 
important role in cellular signaling within and between cells. PTPs work antagonistically 
with protein tyrosine kinases (PTKs) to regulate signal transduction in a cell. 
[0011] A few protein phosphatases have dual specificity and dephosphorylate 
serine/threonine and tyrosine residues. This family now includes major regulators of 
growth cycle such as p80cdc25, as well as phosphatases that regulate the mitogen-activated 
protein kinase pathway. 
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SUMMARY OF THE INVENTION 

[0012] The present invention discloses compositions, organisms and methodologies 
employing a new human gene that encodes a protein sharing sequence homology with the 
calcineurin-like phosphoesterase family. The gene is localized in locus 10pl5 of human 
chromosome 10. This new gene is hereinafter referred to as human calcineurin-like protein 
phosphatase 1 (CLPP1) gene, and its encoded protein(s) is referred to as CLPP1 or CLPP1 
phosphatase. 

[0013] The amino acid residues 47-21 1 of CLPP1 show 91 .3% sequence alignment 
with the consensus sequences of the calcineurin-like phosphoesterase family. This family 
includes a diverse range of phosphoesterases, including protein phosphoserine 
phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2 -3' c AMP 
phosphodiesterases as well as nucleases such as bacterial SbcD or yeast MRE1 1. The utility 
of calcineurin-like phosphoesterase family members are known in the art. The unique 
peptide sequence of CLPP1, and nucleic acid sequences that encode the peptides, can be 
used as models for the development of human therapeutic targets, aid in the identification of 
therapeutic proteins, and serve as targets for the development of human therapeutic agents 
that modulate phosphatase activity in cells and tissues that express the phosphatase. 
[0014] In one aspect, the invention provides isolated polynucleotides comprising a 
nucleotide sequence encoding CLPP1 or a variant of CLPP1. 

[0015] In another aspect, the invention provides isolated polypeptides comprising the 
amino acid sequence of CLPP1 or a variant of CLPP1. 

[0016] In yet another aspect, the invention provides agents that modulate expression 
level of CLPP 1 gene or an activity of CLPP 1 . 

[0017] The invention also provides methods for (a) detecting polynucleotides 
comprising a nucleotide sequence encoding CLPP1 or a variant of CLPP1 and (b) detecting 
polypeptides comprising an amino acid sequence of CLPP1 or a variant of CLPP 1 in a 
biological sample. 

[0018] The invention further provides methods for screening agents that modulate 
expression level of CLPP 1 gene or an activity of CLPP1. 
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The invention further provides cell lines harboring the CLPP1 gene, animals transgenic for 
CLPP1 gene, and animals with interrupted CLPP1 gene (CLPP1 knockout animals). These 
cell lines and animals can be used to study the functions of CLPP1 . 
[0019] In still another aspect, the invention provides polynucleotides capable of 
inhibiting CLPP1 gene expression by RNA interference. 

[0020] The invention further provides methods of inhibiting CLPP 1 gene expression by 
introducing siRNA's or other RNAi sequences into target cells. 
[0021] The preferred embodiments of the inventions are described below in the 
Detailed Description of the Invention. Unless specifically noted, it is intended that the 
words and phrases in the specification and claims be given the ordinary and accustomed 
meaning to those of ordinary skill in the applicable art or arts. If any other meaning is 
intended, the specification will specifically state that a special meaning is being applied to a 
word or phrase. 

[0022] It is further intended that the inventions not be limited only to the specific 
structure, material or acts that are described in the preferred embodiments, but include any 
and all structures, materials or methods that perform the claimed function, along with any 
and all known or later-developed equivalent structures, materials or methods for performing 
the claimed function. 

[0023] Further examples exist throughout the disclosure, and it is not applicant's 
intention to exclude from the scope of his invention the use of structures, materials, or 
methods that are not expressly identified in the specification, but nonetheless are capable of 
performing a claimed function. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0024] The inventions of this application are better understood in conjunction with the 
following drawings, in which: 

[0025] FIGURE 1 compares amino acid residues 47 to 2 1 1 of CLPP 1 to the consensus 

sequences of the calcineurin-like phosphoesterase family. 

[0026] FIGURE 2 shows the hydrophobicity profile of CLPP 1 . 

DETAILED DESCRIPTION OF THE INVENTION 

[0027] The following detailed description is presented to enable any person skilled in 
the art to make and use the invention. For purposes of explanation, specific nomenclature is 
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set forth to provide a thorough understanding of the present invention. However, it will be 
apparent to one skilled in the art that these specific details are not required to practice the 
invention. Descriptions of specific applications are provided only as representative 
examples. Various modifications to the preferred embodiments will be readily apparent to 
one skilled in the art, and the general principles defined herein may be applied to other 
embodiments and applications without departing from the scope of the invention. The 
present invention is not intended to be limited to the embodiments shown, but is to be 
accorded the widest possible scope consistent with the principles and features disclosed 
herein. 

[0028] The present invention is based on the sequence information obtained from a 
newly-developed genomic prediction pipeline. Briefly, the X-ray crystal structures of the 
catalytic domains of protein phosphatases were collected and aligned together according to 
their structural identity/similarities. The alignment was converted into a "scoring matrix" 
which carried the structural profile of the phosphatase catalytic domains. This scoring 
matrix was then used to search the Celera Human Genome database and pull out sequences 
that have phosphatase catalytic domains. 

[0029] Based on this analysis, the present invention provides the amino acid sequence 
of a human peptide containing a phosphatase domain that is highly homologous to the 
consensus sequences of the calcineurin-like phosphoserine family, cDNA sequences and 
genomic sequences that encode the phosphatase peptide, and information about the closest 
art known protein/peptide/domain that has structural or sequence homology to the 
phosphatase of the present invention. 

[0030] The peptide of the present invention may be used for the development of 
commercially important products and services. Various aspects of the invention are 
described in detail in the following subsections. It should of course be understood that the 
use of subsections is not meant to limit the invention. Rather, each subsection applies to 
any aspect of the invention, as is appropriate. 

De finitions and Terms 

[0031] To facilitate the understanding of the present invention, a number of terms and 
phrases are defined below: 

[0032] As used herein, a polynucleotide or a polypeptide is "isolated" if it is removed 
from its native environment. For instance, a polynucleotide or a polypeptide is isolated 
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through a purification process such that the polynucleotide or polypeptide is substantially 
free of cellular material or free of chemical precursors. The polynucleotide/polypeptide of 
the present invention can be purified to homogeneity or other degrees of purity. The level 
of purification will be based on the intended use. As appreciated by one of ordinary skill in 
the art, a polynucleotide/polypeptide can perform its desired function(s) even in the 
presence of considerable amounts of other components or molecules. 
[0033] In some uses, a polynucleotide/polypeptide that is "substantially free of cellular 
material" includes preparations which have less than about 30% (by weight) other 
polynucleotides/polypeptides including contaminating polynucleotides/polypeptides. For 
instance, the preparations can have less than about 20%, less than about 10%, or less than 
about 5% other polynucleotides/polypeptides. If a polynucleotide/polypeptide preparation 
is recombinantly produced, it can be substantially free of culture medium, i.e., culture 
medium components representing less than about 20% by weight of the 
polynucleotide/polypeptide preparation. 

[0034] The language "substantially free of chemical precursors" includes preparations 
in which the polynucleotide/polypeptide is separated from chemical precursors or other 
chemicals that are involved in the synthesis of the polynucleotide/polypeptide. In one 
embodiment, the language "substantially free of chemical precursors" includes kinase 
preparations having less than about 30% (by weight), less than about 20% (by weight), less 
than about 10% (by weight), or less than about 5% (by weight) chemical precursors or other 
chemicals used in the synthesis. 

[0035] A "polynucleotide" can include any number of nucleotides. For instance, a 
polynucleotide can have at least 10, 20, 25, 30, 40, 50, 100 or more nucleotides. A 
polynucleotide can be DNA or RNA, double-stranded or single-stranded. A polynucleotide 
encodes a polypeptide if the polypeptide is capable of being transcribed and/or translated 
from the polynucleotide. Transcriptional and/or translational regulatory sequences, such as 
promoter and/or enhancer(s), can be added to the polynucleotide before said transcription 
and/or translation occurs. Moreover, if the polynucleotide is singled-stranded, the 
corresponding double-stranded DNA containing the original polynucleotide and its 
complementary sequence can be prepared before said transcription and/or translation. 
[0036] As used herein, "a variant of a polynucleotide" refers to a polynucleotide that 
differs from the original polynucleotide by one or more substitutions, additions, and/or 
deletions. For instance, a variant of a polynucleotide can have 1, 2, 5, 10, 15, 20, 25 or 
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more nucleotide substitutions, additions or deletions. Preferably, the modification(s) is in- 
frame, i.e., the modified polynucleotide can be transcribed and translated to the original or 
intended stop codon. If the original polynucleotide encodes a polypeptide with a biological 
activity, the polypeptide encoded by a variant of the original polynucleotide variants 
substantially retains such activity. Preferably, the biological activity is reduced/enhanced 
by less than 50%, or more preferably, less than 20%, relative to the original activity. 
[0037] A variant of a polynucleotide can be a polynucleotide that is capable of 
hybridizing to the original polynucleotide, or the complementary sequence thereof, under 
reduced stringent conditions, preferably stringent conditions, or more preferably, highly 
stringent conditions. Examples of conditions of different stringency are listed in Table 1. 
Highly stringent conditions are those that are at least as stringent as conditions A-F; 
stringent conditions are at least as stringent as conditions G-L; and reduced stringency 
conditions are at least as stringent as conditions M-R. As used in Table 1 , hybridization is 
carried out under a given hybridization condition for about 2 hours, followed by two 15- 
minute washes under the corresponding washing condition(s). 
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Table 1. Stringency Conditions 



Stringency 
Condition 


Poly- 
nucleotide 
Hybrid 


Hybrid 
_,engtn \py>) 


Hybridization 

1 CllipcIalUlC allKX Liuiivi 


Wash Temp, 
and Buffer 


A 


DNA:DNA 


>50 


65°C; lxSSC -or- 
42°C; lxSSC, 50% formamide 


65°C; 0.3xSSC 


B 


DNA:DNA 


<50 


T B *; lxSSC 


T B *; lxSSC 


C 


DNA:RNA 


>50 


67°C; lxSSC -or- 
45°C; lxSSC, 50% formamide 


67°C; OJxSSC 


D 


DNA:RNA 


<50 


T D *; lxSSC 


T D *; lxSSC 


E 


RNA:RNA 


>50 


70°C; lxSSC -or- 
50°C; lxSSC, 50% formamide 


70°C; 0.3xSSC 


F 


RNA:RNA 


<50 


T F *; lxSSC 


T f *; lxSSC 


G 


DNA:DNA 


>50 


65°C; 4xSSC -or- 
42°C; 4xSSC, 50% formamide 


65°C; lxSSC 


H 


DNA:DNA 


<50 


T H *; 4xSSC 


T H *; 4xSSC 


I 


DNA:RNA 


>50 


67°C; 4xSSC -or- 
45°C; 4xSSC. 50% formamide 


67°C; lxSSC 


J 


DNA:RNA 


<50 


Tj*; 4xSSC 


Tj*;4xSSC 


K 


RNA:RNA 


>50 


70°C; 4xSSC -or- 
50°C; 4xSSC, 50% formamide 


67°C; lxSSC 


L 


RNA:RNA 


<50 


T L *; 2xSSC 


T L *; 2xSSC 


M 


DNA:DNA 


>50 


50°C; 4xSSC -or- 
40°C; 6xSSC, 50% formamide 


50°C; 2xSSC 


N 


DNA:DNA 


<50 


T N *; 6xSSC 


T N *; 6xSSC 


O 


DNA:RNA 


>50 


55°C; 4xSSC -or- 
42°C; 6xSSC, 50% formamide 


55°C; 2xSSC 


P 


DNA:RNA 


<50 


T P *; 6xSSC 


T P *; 6xSSC 


Q 


RNA-.RNA 


>50 


60°C; 4xSSC -or- 
45°C; 6xSSC, 50% formamide 


60°C; 2xSSC 


R 


RNA:RNA 


<50 


T R *; 4xSSC 


T R *; 4xSSC 



The hybrid length is that anticipated for the hybridized region(s) of the hybridizing 
polynucleotides. When hybridizing a polynucleotide to a target polynucleotide of unknown 
sequence, the hybrid length is assumed to be that of the hybridizing polynucleotide. When 
polynucleotides of known sequence are hybridized, the hybrid length can be determined by 
aligning the sequences of the polynucleotides and identifying the region or regions of 
optimal sequence complementarity. 

H : SSPE (lxSSPE is 0.15M NaCl, lOmM NaH 2 P0 4 , and 1.25mM EDTA, pH 7.4) can be 
substituted for SSC (lxSSC is 0.1 5M NaCl and 15mM sodium citrate) in the hybridization 
and wash buffers. 

T B * - T R *: The hybridization temperature for hybrids anticipated to be less than 50 base 
pairs in length should be 5-10°C less than the melting temperature (T m ) of the hybrid, where 
T m is determined according to the following equations. For hybrids less than 18 base pairs 
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in length T m (°C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 18 and 
49 base pairs in length, T m (°C) = 81.5 + 16.6(log 10 Na + ) + 0.41(%G + C) - (600/N), where N 
is the number of bases in the hybrid, and Na + is the molar concentration of sodium ions m 
the hybridization buffer (Na + for lxSSC = 0.1 65M). 

[0038] It will be appreciated by those of ordinary skill in the art that, as a result of the 
degeneracy of the genetic code, there are many polynucleotide variants that encode the 
same polypeptide. Some of these polynucleotide variants bear minimal sequence homology 
to the original polynucleotide. Nonetheless, polynucleotides that vary due to differences in 
codon usage are specifically contemplated by the present invention. 

[0039] As used herein, a "polypeptide" can include any number of amino acid residues. 
For instance, a polypeptide can have at least 5, 10, 15, 20, 30, 40, 50 or more amino acid 
residues. 

[0040] As used herein, a "variant of a polypeptide" is a polypeptide that differs from 
the original polypeptide by one or more substitutions, deletions, and/or insertions. 
Preferably, these modifications do not substantially change (e.g. reduce or enhance) the 
original biological function of the polypeptide. For instance, a variant can reduce or 
enhance or maintain the biological activities of the original polypeptide. Preferably, the 
biological activities of the variant is reduced or enhanced by less than 50%, or more 
preferably, less than 20%, relative to the original polypeptide. 

[0041] Similarly, the ability of a variant to react with antigen-specific antisera can be 
enhanced or reduced by less than 50%, preferably less than 20%, relative to the original 

— N 

polypeptide. These variants can be prepared and evaluated by modifying the original 
polypeptide sequence and then determining the reactivity of the modified polypeptide with 
the antigen-specific antibodies or antisera. 

[0042] Preferably, a variant polypeptide contains one or more conservative 
substitutions. A "conservative substitution" is one in which an amino acid is substituted for 
another amino acid which has similar properties, such that one skilled in the art would 
expect that the secondary structure and hydropathic nature of the substituted polypeptide 
will not be substantially changed. Conservative amino acid substitutions can be made on 
the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or 
the amphipathic nature of the residues. Negatively charged amino acids include aspartic 
acid and glutamic acid, and positively charged amino acids include lysine and arginine. 
Amino acids having uncharged polar head groups and similar hydrophilicity values include 
leucine, isoleucine and valine, or glycine and alanine, or asparagine and glutamine, or 
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serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that can produce 
conservative changes include: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; (2) cys, ser, tyr, 
thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his. A polypeptide 
variant can also contain nonconservative changes. 

[0043] Polypeptide variants can be prepared by the deletion and/or addition of amino 
acids that have minimal influence on the biological activity, immunogenicity, secondary 
structure and/or hydropathic nature of the polypeptide. Variants can be prepared, for 
instance by substituting, modifying, deleting or adding one or more amino acids residues in 
the original sequence. Polypeptide variants preferably exhibit at least about 70%, more 
preferably at least about 90%, and most preferably at least about 95% sequence homology 
to the original polypeptide. 

[0044] Polypeptide variants include polypeptides that are modified from the original 
polypeptides either by a natural process, such as a post-translational modification, or by a 
chemical modification. These modifications are well known in the art. Modifications can 
occur anywhere in the polypeptide, including the backbone, the amino acid side-chains and 
the amino or carboxyl termini. It will be appreciated that the same type of modification can 
be present in the same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide can contain many types of modifications. Polypeptides may be branched, 
for example, as a result of ubiquitination, and they may be cyclic, with or without 
branching. Cyclic, branched, and branched cyclic polypeptides can result from natural post- 
translational processes or be made through synthetic methods. Suitable modifications for 
this invention include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphatidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine, 
formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, 
transfer-RNA mediated addition of amino acids to proteins such as arginylation, and 
ubiquitination. 

[0045] As used herein, the term "modulation" includes up-regulation, induction, 
stimulation, potentiation, inhibition, down-regulation or suppression, or relief of inhibition. 
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[0046] A nucleotide sequence is "operably linked" to another nucleotide sequence if 
the two sequences are placed into a functional relationship. For example, a coding sequence 
is operably linked to a 5' regulatory sequence if the 5' regulatory sequence can initiate 
transcription of the coding sequence in an in vitro transcription/translation system or in a 
host cell. "Operably linked" does not require that the DNA sequences being linked are 
contiguous to each other. Intervening sequences may exist between two operably linked 
sequences. 

[0047] As used herein, a "disease-free" human refers to a human who does not have 
CLPP1 -related diseases. Disease-free cells, tissues or samples refer to cells, tissues or 
samples obtained from disease-free human(s). 

[0048] A polynucleotide is "capable of hybridizing" to a gene if the polynucleotide can 
hybridize to at least one of the following sequences: (1) the sequence of a RNA transcript of 
the gene, (2) the complementary sequence of a RNA transcript of the gene, (3) the cDNA 
sequence of a RNA transcript of the gene, (4) the complementary sequence of the cDNA 
sequence of a RNA transcript of the gene, (5) a genomic sequence of the gene, and (6) the 
complementary sequence of a genomic sequence of the gene. 

[0049] As used herein, sequence identity or percentage alignment in an alignment can 
be determined by the standard protein-protein or nucleotide-nucleotide BLAST programs 
{i.e., blastp or blastn, respectively). Sequence identity or percentage alignment can also be 
determined by the BLAST2 program. Suitable BLAST programs can be found at the 
BLAST web site maintained by the National Center of Biotechnology Information (NCBI) 
(National Library of Medicine, USA). 

Human CLPP1 Gene and CLPP1 Phosphatase 

[0050] The present invention identifies a new human gene (CLPP1 gene) that encodes 
a protein containing sequences highly homologous to the consensus sequence of the 
calcineurin-like phosphoesterase family. The nucleotide sequence encoding CLPP1 and the 
amino acid sequence of CLPP1 are depicted in SEQ ID NOS:l and 2, respectively. The 
CLPP1 gene is localized in locus 10pl5 of human chromosome 10. Specifically, the 
CLPP1 gene is located between genes LOCI 19379 and LOC159671, and overlaps with 
gene LOCI 59669. 

[0051] Human chromosome loci 1 Op 1 5 and the neighboring regions have been 
associated with multiple diseases, including but not limited to, prostate cancer, glioma, 
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melanoma, primary open-angle glaucoma, primary endometrial carcinoma, multiple 
sclerosis, schizophrenia, and partial DiGeorge syndrome. It has been suggested that the 
distal end of the short arm of chromosome 10 harbors at least one tumor suppressor gene. 
[0052] Human CLPP1 gene has 4 exons. The exons are mapped to the nucleotide 
sequences of human chromosome 10 in Celera genomic database (SEQ ID NO:3). The 
exons are also mapped to nucleotides 4200965 to 4203759 of human chromosome 10 in the 
Entrez Human Genome Sequence Database maintained by NCBI. Table 2 lists the location 
of each of these 4 exons in the genomic sequence SEQ ID NO:3. Table 2 also illustrates the 
corresponding location of each exon in the CLPP1 -coding sequence SEQ ID NO:l. 



Table 2. Exons in Human CLPP1 Gene 



Exon Numbers 


Corresponding Sequence in SEQ 
ID NO:3 


Corresponding Sequence in 
SEQ ID NO: 1 


1 


1-55 


1-55 


2 


789 - 898 


56- 165 


3 


1589-2131 


166-708 


4 


2178-2795 


709- 1326 



[0053] A conserved domain search using RPS-BLAST program (RPS-BLAST 2.2.3 
[April 24, 2002], available at the BLAST web site maintained by NCBI), showed that 
CLPP1 contained sequences homologous to a consensus sequence of a family of protein 
phosphatases. 

[0054] Specifically, the amino acid residues 47 to 21 1 of CLPP1 are highly 
homologous to the consensus sequence of the calcineurin-like phosphoesterase family 
(pfam00149). This family includes a diverse range of phosphoesterases, including protein 
phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' 
cAMP phosphodiesterases as well as nucleases such as bacterial SbcD or yeast MRE11. 
The most conserved regions in this family center around the metal chelating residues. FIG. 
1 shows that the two sequences share 91.3% alignment with a score of 51.5 bits and an E 
value of 6 x 10" 8 . The "Query" denotes the sequence of CLPP1, and "Sbjcf refers to the 
sequence being compared to the CLPP1 sequence. 
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[0055] In addition, CLPP1 also shares partial sequence homologies to a number of 
ser/thr phosphatases, including but are not limited to, Bacteriophage lambda ser/thr protein 
phosphatase (Entrez accession number: P03772, 75% alignment to amino acid residues 55 - 
242 of CLPP1), E. coli ser/thr protein phosphatase ( Entrez accession number: NC002695, 
74% alignment to amino acid residues 55 - 242 of CLPP1), E. coli phosphoprotein 
phosphatase 1 (Entrez accession number: AP002559, 47% alignment to amino acid residues 
60 - 239 of CLPP1), and S. typhimurium protein phosphatase B (Entrez accession number: 
AY049951, 42% alignment to amino acid residues 43 - 239 of CLPP1). 
[0056] FIGURE 2 shows the hydrophobicity profile of CLPP1 . The hydrophobicity 
analysis indicates that CLPP1 phosphatase is not likely a membrane or transmembrane 
protein. 

[0057] BLAST analysis showed that CLPP1 is identical in both amino acid and 
nucleotide sequences to a hypothetical human protein XP_1 00270 (Entrez accession 
number: XM100270), which was predicted from NCBI contig NT0241 15 by automated 
computational analysis using gene prediction method GenomeScan. 
[0058] The existence and expression of CLPP 1 gene in humans are supported by 
various EST sequences. For instance, nucleotides 1 - 57 of SEQ ID NO:l are supported by 
the EST sequences disclosed under NCBI EST database accession number BI829819; 
nucleotides 172-815 of SEQ ID NO:l are supported by the EST sequence disclosed under 
NCBI DNA patent database accession numbers AAS65776, AAS65777, and AAS82423; 
nucleotides 763 - 1308 of SEQ ID NO: 1 are supported by the EST sequences disclosed 
under ENSEMBL database accession number ENST00000266954. 

Utility ofCLPPl eene and CLPP1 phosphatase 

[0059] As the counterpart of protein kinases, protein phosphatases are involved in the 
regulation of many critical biological processes. The protein 

phosphorylation/dephosphorylation cycle is one of the major regulatory mechanisms 
employed by eukaryotic cells to control cellular activities including growth and 
differentiation, cell-to-cell contacts, the cell cycle, and oncogenesis. For example, cellular 
signal transduction is a fundamental mechanism whereby external stimuli that regulate 
diverse cellular processes are relayed to the interior of cells. The biochemical pathways 
through which signals are transmitted within cells comprise a circuitry of directly or 
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functionally connected interactive proteins. One of the key biochemical mechanisms of 
signal transduction involves the reversible phosphorylation of certain residues on proteins. 
[0060] Based on their substrate specificity, protein phosphatases may be roughly 
divided into three families: serine/threonine (S/T) phosphatases, tyrosine phosphatases, 
(PTPs), and dual specificity phosphatases (DSPs). 

[0061] Serine/threonine phosphatases are either cytosolic or associated with a receptor. 
On the basis of their sensitivity to two thermostable proteins, inhibitors 1 and 2, and their 
divalent cation requirements, the serine/threonine phosphatases can be separated into four 
distinct groups, PP1, PP2A, PP2B, and PP2C. 

[0062] PP1 is a major eukaryotic protein serine/threonine phosphatase that regulates an 
enormous variety of cellular functions through the interaction of its catalytic subunit (PPlc) 
with over fifty different established or putative regulatory subunits. PP1 dephosphorylates 
many of the proteins phosphorylated by cyclic AMP-dependent protein phosphatase and is 
therefore an important regulator of many cyclic AMP mediated, hormone responses in cells. 
Most of these PPlc targets interact with a small hydrophobic groove on the surface of PPlc 
through a short conserved binding motif, the RVxF motif, which is often preceded by 
further basic residues. Weaker interactions may subsequently enhance binding and 
modulate PP1 activity/specificity in a variety of ways. Several putative targeting subunits 
do not possess an RVxF motif but nevertheless interact with the same region of PPlc. In 
addition, several 'modulator' proteins bind to PPlc but do not possess a domain targeting 
them to a specific location. Most are potent inhibitors of PPlc and possess at least two sites 
for interaction with PPlc, one of which is identical or similar to the RVxF motif. 
Regulation of PPlc in response to extracellular and intracellular signals occurs mostly 
through changes in the levels, conformation or phosphorylation status of targeting subunits. 
Understanding of the mode of action of PPlc complexes may facilitate development of 
drugs that target particular PPlc complexes and thereby modulate the phosphorylation state 
of a very limited subset of proteins. 

[0063] PP2A is the main phosphatase responsible for reversing the phosphorylations of 
serine/threonine phosphatases. All forms of PP2A contain a catalytic subunit (PP2Ac) 
which forms a stable complex with the structural subunit PR65/A. The heterodimer PP2Ac- 
PR65/A associates with regulatory proteins, termed variable subunits, in order to form 
trimeric holoenzymes attributed with distinct substrate specificity and targeted to different 
subcellular compartments. PP2Ac activity can be modulated by reversible phosphorylation 
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on Tyr307 and methylation on C-terminal Leu309. Studies on PP2A have shown that this 
enzyme may be implicated in the regulation of metabolism, transcription, RNA splicing, 
translation, differentiation, cell cycle, oncogenic transformation and signal transduction. 
[0064] PP2B, or calcineurin (Cn), is a Ca 2+ -activated phosphatase; it is involved in the 
regulation of such diverse cellular functions as ion channel regulation, neuronal 
transmission, gene transcription, muscle glycogen metabolism, and lymphocyte activation. 
[0065] PP2C is a Mg 2+ -dependent phosphatase which participates in a wide variety of 
functions including regulating cyclic AMP-activated protein-phosphatase activity, Ca 2+ - 
dependent signal transduction, tRNA splicing, and signal transmission related to heat shock 
responses. PP2C is a monomelic protein with a molecular mass of about 40-45 kd. One 
alpha and several beta isoforms of PP2C have been identified. 
[0066] Protein tyrosine phosphatases (PTPs) remove a phosphate moiety from 
phosphotyrosine. Their substrates include elements of signal transduction pathway such as 
receptors, transcription factors or ion channels, as well as structural proteins such as 
filaments and cellular motors. PTPs regulate cell differentiation and are also involved in 
malignant transformation. PTP1B, a major phosphatase of breast cancer cell lines, activates 
an oncogene, c-src. Regulatory networks of PTPs can be extensive and redundant, with one 
enzyme being able to complement others. The sequence provided by the present invention 
can be used to isolate a possible group of interrelated PTPs. Specific inhibitors of protein 
phosphatases are being developed. Especially promising are the drugs that target PTPs in 
hematopoietic and lymphoid tissues; such drugs can be used to treat various malignant 
conditions. 

[0067] Dual-specificity phosphatases (DSPs) are important for regulating mitogenic 
signal transduction processes that occur, for example, in the cell cycle and in extracellular 
stimulation pathways. DSPs may be involved in regulating meiosis and may also play an 
important role in regulating the differentiation of testicular germ cells during 
spermatogenesis. 

[0068] The levels of protein phosphorylation required for normal cell growth and 
differentiation at any time are achieved through the coordinated action of protein kinases 
and phosphatases. Depending on the cellular context, these two types of enzymes may either 
antagonize or cooperate with each other during signal transduction. An imbalance between 
these enzymes may impair normal cell functions leading to metabolic disorders and cellular 
transformation. 
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[0069] Protein kinases have been a major target for drug action and development. A 
January 2002 survey of ongoing clinical trials in the USA revealed more than 100 clinical 
trials involving the modulation of kinases. Trials are ongoing in a wide variety of 
therapeutic indications including asthma, Parkinson's, inflammation, psoriasis, rheumatoid 
arthritis, spinal cord injuries, muscle conditions, osteoporosis, graft versus host disease, 
cardiovascular disorders, autoimmune disorders, retinal detachment, stroke, epilepsy, 
ischemia/reperfusion, breast cancer, ovarian cancer, glioblastoma, non-Hodgkin's 
lymphoma, colorectal cancer, non-small cell lung cancer, brain cancer, Kaposi's sarcoma, 
pancreatic cancer, liver cancer, and other tumors. Protein phosphatases are among the many 
modulators of kinase activity. 

[0070] For example, the overexpression of PTKs, such as HER2, can play a decisive 
role in the development of cancer and that antibodies capable of blocking the activity of this 
enzyme can abrogate tumor growth. Blocking the signal transduction capability of tyrosine 
phosphatases such as Flk-1 and the PDGF receptor have been shown to block tumor growth 
in animal models. 

[0071] The importance of phosphatases in the etiology of diseases has been well 
established. Malfunction of phosphatase activity has been associated with a number of 
human disorders. For example, the gene for human RPTPy has been localized to 
chromosome 3p21 which is a segment frequently altered in renal and small lung carcinoma 
(Wary et al., Cancer Res 52:478-482, 1993). Loss of phosphatase activity in myotubularin- 
related protein 2 is associated with Charcot-Marie-Tooth disease type 4B1 (Berger P. et al., 
Hum Mol Genet 11:1 569-79, 2002). There is also evidence linking the low-molecular- 
weight protein tyrosine phosphatase (LMPTP) to several common diseases, including 
allergy, asthma, obesity, myocardial hypertrophy, and Alzheimer's disease (Bottini N et al., 
Arch Immunol Ther Exp (Warsz) 50:95-104, 2002) 

[0072] Regulation of signal transduction by cytokines and association of signal 
molecules with protooncogenes and tumor suppressor genes have been the subjects of 
intense research. Recent genetic and biochemical studies indicate that protein phosphatases, 
which play an important role in signal transduction process, represent a novel platform for 
drug discovery. Detailed knowledge of protein phosphatase substrate specificity and the 
wealth of structure data on protein phosphatases provide a solid foundation for rational 
phosphatase inhibitor design. Many therapeutic strategies can now be developed through 
the synthesis of compounds which activate or inactivate protein phosphatases. 
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[0073] The ability of phosphatase inhibitors to interfere with aberrant cell activity has 
been demonstrated. For example, the naturally occurring serine/threonine phosphatase 
inhibitor okadaic acid has been shown to induce apoptosis in myeloid leukemia cells (Ishida 
Y. et al., J. Cell. Physiol. 150:484, 1992) and in rat hepatocytes, rat pituitary adenoma cell, 
human mammary carcinoma cells and human neuroblastoma cells (Boe R. et al., Exp. Cell 
Res. 195:237, 1991). It is also well known that the immunosuppressive activity of 
cyclosporin A, a commonly used immunesuppressor in organ transplantation, is mediated 
by inhibiting calcineurin phosphatase. Recently, cyclosporin A was found to enhance taxol- 
induced apoptosis of human urinary bladder cancer cells (Nomura T. et al., Urol Res 
30:102-1 1 , 2002). The study suggests that the effect of cyclosporin A is at least partly due 
to the inhibition of calcineurin activity and the loss of the antiapoptotic function of Bcl-2 via 
the enhancement of phosphorylation and the reduction of expression. 
[0074] In summary, phosphatase proteins are a major target for drug action and 
development. Accordingly, it is valuable to the field of pharmaceutical development to 
identify and characterize previously unknown members of phosphatase proteins. The 
present invention advances the state of the art by providing a novel human phosphatase 
protein that has sequence similarities to calcineurin-like phosphatases family. Specifically, 
the phosphatase domain in CLPP1 shares a 91.3% sequence identity with the consensus 
sequences of the calcineurin-like phosphatases. This domain, either in its native form or in 
a mutant form, can be used to affect the function of the corresponding domain in other 
phosphatases. The phosphatase domain in CLPP1 can be used to dephosphorylate suitable 
substrates. 

[0075] The CLPP1 gene and the gene products can be used as molecular markers for 
diagnosing, prognosing, and monitoring the treatment of disorders related to the aberrant 
expression of CLPP1 . The gene provided by the present invention can be expressed in yeast 
to identify possible interactors and substrates; this can be done by means of a 
complementation assay or a two-hybrid experiment. In addition, the CLPP1 gene can be 
used to screen for potential agents or drugs capable of enhancing or inhibiting the CLPP1 
gene expression in human cells. The CLPP1 gene products (polynucleotide and 
polypeptide) can be used to screen for potential agents or drugs capable of enhancing or 
inhibiting CLPP1 activity. Artificially synthesized enzymes as well as derived peptides can 
be used to activate or inhibit cellular processes modulated by CLPP1 . Immunoassay or 
PCR may be used to measure the concentration of CLPP1 and detect abnormally developing 
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tissue or cancerous growth. Furthermore, various therapeutic methods for treating disorders 
related to the aberrant expression of CLPPl can be designed based on the CLPPl gene, its 
variants, or the agents/drugs that affect the expression of the CLPPl gene or the activity of 
the CLPPl gene products. 

[0076] The following subsections illustrate examples of the utilities of human CLPPl 
gene and CLPPl phosphatase. Various changes and modifications within the scope of the 
present invention will become apparent to those skilled in the art from the present 
description. 

Polynucleotides and V ariants Thereof 

[0077] One aspect of the invention pertains to isolated polynucleotide probes capable 
of hybridizing to the CLPPl gene or its transcripts, such as CLPPl mRNAs. These probes 
can be used to detect the expression level of the CLPPl gene in human tissue or cells. The 
present invention also contemplates polynucleotide fragments for use as PCR primers for 
the amplification or mutation of the CLPPl gene or the CLPPl-coding sequences. Another 
aspect of the invention pertains to isolated polynucleotides that encode CLPPl, or a 
fragment or mutant thereof. These polynucleotides can be used for expressing CLPPl , or a 
fragment or mutant thereof. The protein products thus expressed can be used to screen for 
agents/drugs that modulate an activity of CLPPl. In addition, these polynucleotides can be 
used to designing gene therapy vectors which target the expression of the CLPPl gene or an 
activity of CLPP 1 in humans. 

[0078] A polynucleotide comprising SEQ ID NO: 1 or SEQ ID NO:3 can be prepared 
using standard molecular biology techniques as appreciated by one of ordinary skill in the 
art. For instance, primers derived from the 5' and 3' ends of SEQ ID NO:l can be used to 
amplify mRNAs isolated from human tissues. The cDNA thus produced contains SEQ ID 
NO:l. Likewise, primers for amplifying the human genomic sequence containing SEQ ID 
NO:3 can be designed and used to prepare the genomic sequence of the CLPPl gene. A 
variant (such as a homolog) or a fragment of SEQ ID NOT or SEQ ID NO:3 can be 
similarly prepared. Alternatively, probes can be designed to screen for cDNA or genomic 
sequence libraries in order to identify polynucleotide molecules comprising the lull-length 
or fragments of SEQ ID NO:l or SEQ ID NO:3. The molecules thus identified can be used 
to create suitable vectors comprising the full-length SEQ ID NOT or SEQ ID NO:3. 
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[0079] Polynucleotides capable of hybridizing to the CLPPl gene can be prepared by 
standard synthetic techniques, e.g., using an automated DNA synthesizer. Preferably, the 
polynucleotide probes can hybridize to the CLPPl gene under reduced stringent conditions, 
stringent conditions, or highly stringent conditions. In one embodiment, the 
polynucleotides comprise at least 15, 20, 25, 30, 50, 100, 150, 200, 250, 300, 350, 400, 450, 
500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000 or more consecutive nucleotides of 
SEQ ID NO:l. Any fragments of SEQ ID NO: 1 and SEQ ID NO: 3 may be used as 
hybridization probes or PCR primers for the CLPPl gene or its transcripts. The 
probes/primers can be substantially purified. 

[0080] In a preferred embodiment, the hybridization probes for the CLPPl gene 
comprise a label group. The label group can be a radioisotope, a fluorescent compound, an 
enzyme, or an enzyme co-factor. Probes thus labeled can be used as part of a diagnostic kit 
for determining the expression level of the CLPPl gene in human tissues. 
[0081] This invention encompasses human CLPPl gene homologs in other species. 
These homologs can be determined by search different sequence databases, such as the 
Entrez/GenBank sequence databases maintained by the NCBI. The invention also 
encompasses polynucleotide molecules which are structurally different from the molecules 
described above, but have the substantially same properties as the molecules described 
above. Such molecules include allelic variants, which will be described below in greater 
detail. 

[0082] DNA sequence polymorphism in human CLPPl gene exists among different 
individuals due to natural allelic variations. An allele is one of a group of genes which 
occur alternatively at a given genetic locus. DNA polymorphisms that affect the RNA 
expression level of the CLPPl gene can also exist, e.g. through affecting the regulation or 
degradation of expression of the gene. The present invention contemplates all allelic 
variants of human CLPPl gene. Allelic variants and other homologs of the CLPPl gene 
ean be isolated using probes/primers derived from SEQ ID NO:l or SEQ ID NO:3. 
[0083] It should, of course, be understood that SEQ ID NO: 1 and SEQ ID NO:3 can be 
modified. The modified polynucleotides can comprise one or more mutations. These 
mutations can be substitutions, additions or deletions of 1, 2, 3, 5, 10, 15, 20 or more 
nucleotide residues in SEQ ID NO:l or SEQ ID NO:3. Standard techniques can be used, 
such as site-directed mutagenesis or PCR-mediated mutagenesis. Preferably, these 
mutations create conservative amino acid substitutions. Alternatively, mutations can be 
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introduced randomly along all or part of the CLPPl gene or its cDNA, such as by saturation 
mutagenesis. Following mutagenesis, the encoded proteins can be expressed recombinantly 
and their activities can be determined. 

[0084] In one embodiment, nucleotide substitutions leading to amino acid substitutions 
at "non-essential" amino acid residues can be introduced. A "non-essential" amino acid 
residue is a residue that can be altered without changing the biological activity of the 
protein. In contrast, an "essential" amino acid residue is required for the biological activity 
of the protein. Amino acid residues that are conserved among allelic variants or homologs 
of the CLPPl gene from different species preferably are not changed in the present 
invention. 

[0085] Accordingly, another aspect of the invention pertains to CLPPl proteins that 
contain changes in amino acid residues that are not essential for the biological activity of 
CLPPl. These proteins differ in amino acid sequence from the original human CLPPl, but 
retain its biological activity. In one embodiment, the modified protein comprises an amino 
acid sequence at least about 85%, 90%, 95%, 98% or more homologous to SEQ ID NO:2. 
[0086] In another embodiment, CLPPl proteins contain mutations in amino acid 
residues which result in inhibition of CLPPl activity. These mutated CLPPl proteins can 
be used to inhibit CLPPl activity in patients with disorders related to the aberrant 
expression of CLPPl . 

[0087] A polynucleotide of this invention can be further modified to increase its 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5' and/or 3 1 ends; the use of phosphorothioate or 2-o-methyl rather 
than phosphodiester linkages in the backbone; and/or the inclusion of nontraditional bases 
such as inosine, queosine and wybutosine, as well as acetyl- methyl-, thio- and other 
modified forms of adenine, cytidine, guanine, thymine and uridine. 
[0088] Polynucleotide molecules which are antisense to the CLPP 1 gene can be 
prepared. An "antisense" polynucleotide comprises a nucleotide sequence which is 
complementary to a "sense" polynucleotide which encodes a protein. An antisense 
polynucleotide can bind via hydrogen bonds to the sense polynucleotide. 
[0089] Antisense polynucleotides of the invention can be designed according to the 
rules of Watson and Crick base pairing. The antisense polynucleotide molecule can be 
complementary to the entire coding region or part of the coding region of the CLPPl gene. ^ 
The antisense polynucleotide molecule can also be complementary to a "noncoding region" 
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in the coding strand of the CLPP1 gene. Preferably, the antisense polynucleotide is an 
oligonucleotide which is antisense to only a portion of the CLPP1 gene. An antisense 
polynucleotide can be, for example, about 5, 10, 1 5, 20, 25, 30, 35, 40, 45 or 50 nucleotides 
in length. An antisense polynucleotide of the invention can be constructed using chemical 
synthesis and enzymatic ligation reactions as appreciated by one of ordinary skill in the art. 
For example, an antisense polynucleotide can be chemically synthesized using naturally 
occurring nucleotides or variously modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical stability of the duplex formed between 
the antisense and sense polynucleotides. Examples of modified nucleotides which can be 
used to generate the antisense polynucleotide include 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 
5-(carboxyhydroxymethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyl adenosine, unacil-5-oxyacetic acid, wybutoxosine, pseudouracil, queosine, 
2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Phosphorothioate derivatives and acridine substituted nucleotides can 
also be used. Alternatively, the antisense polynucleotide can be produced biologically using 
an expression vector into which a polynucleotide has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted polynucleotide will be of an antisense 
orientation to the target polynucleotide of interest). 

[0090] The antisense polynucleotides of the invention can be administered to a subject 
or applied in situ such that they hybridize or bind to cellular mRNAs and/or genomic DNAs 
that encode CLPP1, thereby inhibiting the expression of CLPP1. The hybridization can 
result in a stable duplex via conventional nucleotide complementarity. An example route 
for administering antisense polynucleotides includes direct injection at a tissue site. 
Antisense polynucleotides can also be modified first, and then administered systemically. 
For example, for systemic administration, antisense molecules can be modified such that 
they specifically bind to receptors or antigens expressed on a selected cell surface. Suitable 
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modifications include linking the antisense polynucleotides to peptides or antibodies which 
bind to the cell surface receptors or antigens. In addition, the antisense polynucleotides can 
be delivered to cells using vectors. To achieve sufficient intracellular concentrations of the 
antisense molecules, strong pol II or pol III promoters may be used in the vectors. 
[0091] In one embodiment, the antisense polynucleotides are a-anomeric 
polynucleotides. An a-anomeric polynucleotide molecule forms specific double-stranded 
hybrid with a complementary RNA in which, contrary to the usual p-units, the strands run 
parallel to each other. The antisense polynucleotide molecule can also comprise a 
2-o-methylribonucleotide or a chimeric RNA-DNA analogue. 

[0092] In another embodiment, the antisense polynucleotide is a ribozyme. Ribozymes 
are catalytic RNA molecules with ribonuclease activity which are capable of cleaving a 
single-stranded polynucleotide, such as an mRNA, to which they have a complementary 
region. Thus, ribozymes {e.g., hammerhead ribozymes described in Haselhoif and Gerlach 
Nature 334:585-591, 1988) can be used to catalytically cleave mRNA transcripts of CLPP1 
in order to inhibit its expression. A ribozyme having specificity for the CLPP1 gene or its 
transcripts can be designed based upon SEQ ID NO:l or 3. mRNAs transcribed from the 
CLPP1 gene can be used to select from a pool of RNA molecules a catalytic RNA having a 
specific ribonuclease activity. 

[0093] Alternatively, the expression of the CLPP 1 gene can be inhibited by using 
nucleotide sequences complementary to the regulatory region {e.g., the promoter and/or 
enhancers). These nucleotide sequences can form triple helical structures that prevent 
transcription of the gene in the target cells. 

[0094] Expression of the CLPP1 gene can also be inhibited using RNA interference 
("RNAi"). RNAi is a phenomenon in which the introduction of double-stranded RNA 
(dsRNA) into certain organisms or cell types causes degradation of the homologous mRNA. 
First discovered in the nematode Caenorhabditis elegans, RNAi has since been found to 
operate in a wide range of organisms. For example, in mammalian cells, introduction of 
long dsRNA (>30 nucleotides) can initiate a potent antiviral response, exemplified by 
nonspecific inhibition of protein synthesis and RNA degradation. RNA interference 
provides a mechanism of gene silencing at the mRNA level. In recent years, RNAi has 
become an endogenous and potent gene-specific silencing technique that uses double- 
stranded RNAs (dsRNA) to mark a particular transcript for degradation in vivo. It also 
offers an efficient and broadly applicable approach for gene knock-out. In addition, RNAi 
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technology can be used for therapeutic purposes. For example, RNAi targeting Fas- 
mediated apoptosis has been shown to protect mice from fulminant hepatitis. RNAi 
technology has been disclosed in numerous publications, such as U.S. Patent Nos. 
5,919,619, 6,506,559 and PCT Publication Nos. W099/14346, WO01/70949, 
WO01/36646, WO00/63364, WO00/44895, WO01/75164, WO01/92513, WO01/68836 and 
WO01/29058. 

[0095] A sequence capable of inhibiting gene expression by RNA interference can 
have any length. For instance, the sequence can have at least 10, 15, 20, 25, 30, 35, 40, 45, 
50, 100, or more consecutive nucleotides. The sequence can be dsRNA or any other type of 
polynucleotide, provided that the sequence can form a functional silencing complex to 
degrade the target mRNA transcript. 

[0096] In one embodiment, the sequence comprises or consists of a short interfering 
RNA (siRNA). The siRNA can be dsRN A having 19-25 nucleotides. siRNAscanbe 
produced endogenously by degradation of longer dsRNA molecules by an RNase Ill-related 
nuclease called Dicer. siRNAs can also be introduced into a cell exogenously or by 
transcription of an expression construct. Once formed, the siRNAs assemble with protein 
components into endoribonuclease-containing complexes known as RNA-induced silencing 
complexes (RISCs). An ATP-generated unwinding of the siRNA activates the RISCs, 
which in turn target the complementary mRNA transcript by Watson-Crick base-pairing, 
thereby cleaving and destroying the mRNA. Cleavage of the mRNA takes place near the 
middle of the region bound by the siRNA strand. This sequence-specific mRNA 
degradation results in gene silencing. 

[0097] At least two ways can be employed to achieve siRNA-mediated gene silencing. 
First, siRNAs can be synthesized in vitro and introduced into cells to transiently suppress 
gene expression. Synthetic siRNA provides an easy and efficient way to achieve RNAi. 
siRNA are duplexes of short mixed oligonucleotides which can include, for example, 19 
nucleotides with symmetric dinucleotide 3' overhangs. Using synthetic 21 bp siRNA 
duplexes (e.g., 19 RNA bases followed by a UU or dTdT 3'overhang), sequence-specific 
gene silencing can be achieved in mammalian cells. These siRNAs can specifically 
suppress targeted gene translation in mammalian cells without activation of DNA- 
dependent protein kinase (PKR) by longer dsRNA, which may result in non-specific 
repression of translation of many proteins. 
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[0098] Second, siRNAs can be expressed in vivo from vectors. This approach can be 
used to stably express siRNAs in cells or transgenic animals. In one embodiment, siRNA 
expression vectors are engineered to drive siRNA transcription from polymerase III (pol III) 
transcription units. Pol III transcription units are suitable for hairpin siRNA expression, 
since they deploy a short AT rich transcription termination site that leads to the addition of 
2 bp overhangs (e.g., UU) to hairpin siRNAs - a feature that is helpful for siRNA function. 
The Pol III expression vectors can also be used to create transgenic mice that express 
siRNA. 

[0099] In another embodiment, siRNAs can be expressed in a tissue-specific manner. 
Under this approach, long double-stranded RNAs (dsRNAs) are first expressed from a 
promoter (such as CMV (pol II)) in the nuclei of selected cell lines or transgenic mice. The 
long dsRNAs are processed into siRNAs in the nuclei (e.g., by Dicer). The siRNAs exit 
from the nuclei and mediate gene-specific silencing. A similar approach can be used in 
conjunction with tissue-specific promoters to create tissue-specific knockdown mice. 
[0100] Any 3' dinucleotide overhang, such as UU, can be used for siRNA design. In 
some cases, G residues in the overhang are avoided because of the potential for the siRNA 
to be cleaved by RNase at single-stranded G residues. 

[0101] With regard to the siRNA sequence itself, it has been found that siRNAs with 
30-50% GC content can be more active than those with a higher G/C content in certain 
cases. Moreover, since a 4-6 nucleotide poly(T) tract may act as a termination signal for 
RNA pol III, stretches of > 4 Ts or As in the target sequence may be avoided in certain 
cases when designing sequences to be expressed from an RNA pol III promoter. In 
addition, some regions of mRNA may be either highly structured or bound by regulatory 
proteins. Thus, it may be helpful to select siRNA target sites at different positions along the 
length of the gene sequence. Finally, the potential target sites can be compared to the 
appropriate genome database (human, mouse, rat, etc.). Any target sequences with more 
than 16-17 contiguous base pairs of homology to other coding sequences may be eliminated 
from consideration in certain cases. 

[0102] In one embodiment, siRNA can be designed to have two inverted repeats 
separated by a short spacer sequence and end with a string of Ts that serve as a transcription 
termination site. This design produces an RNA transcript that is predicted to fold into a 
short hairpin siRNA. The selection of siRNA target sequence, the length of the inverted 
repeats that encode the stem of a putative hairpin, the order of the inverted repeats, the 
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,ength and composition of the spacer sequence that encodes the loop of the hairpin, and the 
presence or absence of 5 '-overhangs, can vary to achieve desirable results. 
[0103] The siRNA targets can be selected by scanning an mRNA sequence for AA 
dinucleotides and recording the 19 nucleotides immediately downstream of the AA. Other 
methods can also been used to select the siRNA targets. In one example, the selectxon of 
the siRNA target sequence is purely empirically determined (see e.g., Sui et al., Proc. Natl. 
Acad. Sci. USA 99: 5515-5520, 2002), as long as the target sequence starts with GG and 
does not share significant sequence homology with other genes as analyzed by BLAST 
search. In another example, a more elaborate method is employed to select the siRNA 
target sequences. This procedure exploits an observation that any accessible site in 
endogenous mRNA can be targeted for degradation by synthetic oligodeoxyribonucleotide 
/RNase H method (Lee et al., Nature Biotechnology 20:500-505, 2002). 
[0104] In another embodiment, the hairpin siRNA expression cassette is constructed to 
contain the sense strand of the target, followed by a short spacer, the antisense strand of the 
target and 5-6 Ts as transcription terminator. The order of the sense and antisense strands 
within the siRNA expression constructs can be altered without affecting the gene silencing 
activities of the hairpin siRNA. In certain instances, the reversal of the order may cause 
partial reduction in gene silencing activities. 

[0105] The length of nucleotide sequence being used as the stem of siRNA expression 
cassette can range, for instance, from 19 to 29. The loop size can range from 3 to 23 
nucleotides. Other lengths and/or loop sizes can also be used. 

[0106] In yet another embodiment, a 5' overhang in the hairpin siRNA construct can be 
used, provided that the hairpin siRNA is functional in gene silencing. In one specific 
example, the 5' overhang includes about 6 nucleotide residues. 

[0107] In still yet another embodiment, the target sequence for RNAi is a 21-mer 
sequence fragment selected from SEQ ID NO:l . The 5' end of the target sequence has 
dinucleotide "NA," where «N" can be any base and "A" represents adenine. The remaining 
19-mer sequence has a GC content of between 35% and 55%. In addition, the remaimng 
19-mer sequence does not include any four consecutive A or T (i.e., AAAA or TTTT), three 
consecutive G or C (i.e., GGG or CCC), or seven "GC" in a role. Exemplary RNAi target 
sequences identified according to the above-described criteria ("relaxed" criteria) are 
illustrated in Table 3. The siRNA sequences for each target sequence (listed in the row as 
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the target sequence and including the sense strand and the antisense strand) are also 
indicated in Table 3. 

[0108] Additional criteria can also be used for RNAi target sequence design. For 
instance, the GC content of the remaining 19-mer sequence can be limited to between 45% 
and 55%. Moreover, any 19-mer sequence having three consecutive identical bases (i.e., 
GGG, CCC, TTT, or AAA) or a palindrome sequence with 5 or more bases is excluded. 
Furthermore, the remaining 19-mer sequence can be selected to have low sequence 
homology to other human genes. In one specific example, potential target sequences are 
searched by BLASTN against NCBI's human UniGene cluster sequence database. The 
human UniGene database contains non-redundant sets of gene-oriented clusters. Each 
UniGene cluster includes sequences that represent a unique gene. 19-mer sequences 
producing no hit to other human genes under the BLASTN search can be selected. During 
the search, the e-value may be set at a stringent value (such as "1"). Exemplary target 
sequences derived using these additional conditions ("stringent" criteria) are shown in Table 
4 The siRNA sequences for each target sequence (listed in the same row as the target 
sequence and including the sense strand and the antisense strand) are also indicated in Table 
4. 

[0109] The effectiveness of the siRNA sequences listed in Tables 3 and 4, as well as 
any other RNAi sequence derived according to the present invention, can be evaluated using 
various methods known in the art. For instance, a siRNA sequence of the present invention 
can be introduced into a cell that expresses the CLPP1 gene. The polypeptide or mRNA 
level of the CLPP1 gene in the cell can be detected. A substantial change in the expression 
level of the CLPP1 gene before and after the introduction of the siRNA sequence is 
indicative of the effectiveness of the siRNA sequence in suppressing the expression of the 
CLPP1 gene. In one specific example, the expression levels of other genes are also 
monitored before and after the introduction of the siRNA sequence. A siRNA sequence 
which has inhibitory effect on CLPP1 gene expression but does not significantly affect the 
expression of other genes can be selected. In another specific example, multiple siRNA or 
other RNAi sequences can be introduced into the same target cell. These siRNA or RNAi 
sequences specifically inhibit CLPP1 gene expression but not the expression of other genes. 
In yet another specific example, siRNA or other RNAi sequences that inhibit the expression 
of both the CLPP1 gene and other gene or genes can be used. 
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Tahiti. Exempt 



BN Ai Target Seflugnces in the C I gPj fiene and the Corresponding 
<iiRNAs (LMpt Relaxed Criteria) 



Target Sequence 


cinMi Cpnce Strand 


siRNA Antisense Strand 


SEQ ID NO:4 


cpf) m NO'5 


SEQ ID NO:6 


SEQ ID NO:7 


ccn Tn NCV8 


SEQ ID NO:9 


SEQ ID NO: 10 




SEQ ID NO: 12 


SEQ ID NO: 13 


OCA 1T\ TyJO' 1 4 


SEQ ID NO: 15 


SEQ ID NO: 16 


oca TT^ XT A • 1 *7 


SEQ ID NO: 18 


| SEQ ID NO: 19 


oca TFI MO'90 


SEQ ID NO:21 


SEQ ID NO:22 


CCA TF> 

O.EA^ AA ^ L\vJ.<t-J 


SEQ IDNO:24 


| SEQ ID NO:25 


CCA TF\ XirV96 


SEQ ID NO:27 


| SEQ ID NO:28 


oca TFI XTO-90 


SEQ ID NO:30 


| SEQIDNO-.31 


CCA TFI TsJO'^9 


SEQ ID NO: 33 


[~ SEQ ID NO: 34 


cca TP* xjrv^s 


SEQ ID NO:36 


1 SEQ ID NO:37 


CCA TT^ XTO-^R 


SEQ ID NO:39 


| SEQ ID NO:40 


SEQ ID NO:4i 


SEO ID NO:42 


| SEQ ID NO:43 


cca TF* XFfV44. 
b-bv<? AA -* 


SEQ ID NO:45 

****** >^ 


1 SEQ ID NO:46 


CCA TF\ XT O- id 7 


SEQ ID NO:48 


1 SEQ ID NO:49 


CCA TFl ATA*S0 


SEQ ID NO:51 


\~ SEQ ID NO:52 


CCA TF* >JO*S^ 


SEQ ID NO:54 


1 SEQ ID NO:55 


CCA TF\ XJO'S6 


SEQ ID NO:57 


| SEQ ID NO:58 


oca TF> >JO*SQ 


SEQ ID NO:60 


| SEQIDNO:61 


qca Tr\ MO -69 


SEQ ID NO:63 


1 SEQ ID NO:64 


cca TFl XJfVfiS 


SEQ ID NO:66 


j SEQ ID NO:67 




SEQ ID NO:69 


SEQ ID NO:70 


SEQ ID NO:71 


SEQ ID NO:72 


SEQ ID NO:73 


SEQ ID NO:74 


SEQ ID NO:75 


SEQ ID NO:76 


SEQ ID NO:77 


SEQ ID NO:78 


SEQ ID NO:79 


SEQ ID NO:80 


SEQ ID NO:81 


SEQ ID NO:82 


SEQ ID NO:83 


SEQ ID NO:84 


SEQ ID NO: 85 


SEQ ID NO:86 


SEQ ID NO:87 
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Target Sequence 


SliviNA oenSc oiraiiu 


siRNA Antisense Strand 


SEQ ID NO:88 


CCA TF> XTfVRQ 

SEQ ID r\\J.oy 


SEQ ID NO:90 


SEQ ID NO:91 


cca ta XTO'Q9 
bEQ ID INvJ.^Z 


SEQ ID NO:93 


SEQ ID NO:94 


oca tt^ XJfVQS 


SEQ ID NO:96 


SEQ ID NO:97 


SEQ ID anu.^o 


SEQ ID NO:99 


SEQ ID NO: 100 


b EQ ID IN U . 1 U 1 


SEQ ID NO: 102 


[ SEQ ID NO: 103 


ex? A T A XTO* 1 OA 
SEQ ID ANw.lUH 


SEQ ID NO:105 


| SEQ ID NO: 106 


oca in XT A* 1 (Yl 

SEQ ID JNU.iu/ 


SEQ ID NO: 108 


1 SEQ ID NO: 109 


prA TT\ XT A • 1 1 H 

SEQ ID INU. liu 


SEQ ID NO:lll 


J SEQIDNO-.112 


SEQ ID INU.l 13 


SEQ ID NO: 114 


| SEQIDNO:115 


T-« f\ TT\ XT A • 1 1 A 

SEQ ID JNU.l lo 


SEQ ID NO: 117 


\ SEQIDNO:118 


SEQ ID JNU.l IV 


SEQ ID NO: 120 


| SEQIDNO:121 


n-pA TT\ XT A* 1 OO 

SEQ ID JNU.l zz 


SEQ ID NO: 123 


1 SEQ ID NO: 124 


nc/"*\ TTX XT A • 1 O ^ 

SEQ ID NU.Izj 


SEQ ID NO: 126 


| SEQ ID NO :127 


OCA TCi XT A • 1 

SEQ ID NU.lzo 


SEQ ID NO:129 


1 SEQ ID NO: 130 


SEQ ID JNU. 131 


SEQ ID NO: 132 


| SEQ ID NO: 133 


SEQ ID NU.l 34 


SEQ ID NO: 135 


| SEQ ID NO: 136 


oc/~v ir\ XT A • 1 *X7 

SEQ ID NU.l 3 / 


SEQIDNO:138 


1 SEQ ID NO: 139 


o TCi xt/^\ • 1 /in 
SEQ ID JNU.14U 


SEQIDNO:141 


|~ SEQ ID NO: 142 


oca T A XT A • 1 A1 

SEQ ID JNU.14:* 


SEQ ID NO:144 


| SEQ ID NO: 145 


oca TC\ XT A» 1 Af\ 

SEQ ID INU.l *rO 


SEQ ID NO: 147 


1 SEQ ID NO: 148 


oc/~\ tc\ XTA« 1 A Q 

SEQ ID JNU.14V 


SEQ ID NO: 150 


| SEQIDNO:151 


SEQ ID NO: 152 


cpo id NO* 153 


| SEQ ID NO: 154 


OCA TCi XT A • 1 ^ ^ 

SEQ ID JNU. 1 00 


SEQ ID NO: 156 


p SEQ ID NO: 157 


OCA T A XT A- 1 ^ Q 

SEQ ID JNU.lJo 


SEQ ID NO: 159 


|~ SEQ ID NO: 160 


OCA T A XT A • 1 f\ 1 

SEQ ID JNU. loi 


^ SEQ ID NO: 162 


1 SEQ ID NO: 163 


OCA T A XT A • 1 f\A. 

SEQ ID JNU.lo*t 


SEQ ID NO: 165 


| SEQ ID NO: 166 


SEQ ID NO: 167 


SEQ ID NO: 168 


1 SEQ ID NO: 169 


SEQ ID NO: 170 


SEQ ID NO: 171 


I SEQ ID NO: 172 


SEQ ID NO: 173 


SEQ ID NO: 174 


| SEQ ID NO: 175 


SEQ ID NO:176 


SEQ ID NO: 177 
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[ Target Sequence 


• D XT A Con c p QffJUld ' 


siRNA Antisense Strand 


|~ SEQIDNO:178 


oua vr\ isJO* 1 79 


SEQ ID NO: 180 


| SEQIDN0:181 




SEQ ID NO: 183 


1 SEQ ID NO: 184 


oc/"^ TT\ XT A ♦ 1 

SEQ ID MU.loD 


SEQ ID NO: 186 


| SEQ ID NO: 187 


crn TFi MO* 1 RR 


SEQ ID NO:189 


1 SEQIDNO:190 


ccn m NOl 91 


SEQ ID NO: 192 


P SEQIDNO:193 


b EQ ID V-i . 1 7t 


SEQ ID NO: 195 


\ SEQIDNO:196 


oca tf\ XTO - 1 07 
SEQ *U 


SEQIDNO:198 


\ SEQIDNO:199 


SEQ ID JNVJ.ZUU 


SEQ ID NO:201 


[ SEQ ID NO.202 


SEQ ID INvJ.ZUj 


SEQ ID NO:204 | 


| SEQ ID NO:205 


OCA T A XTA-OHA 

SEQ ID INvJ.zuo 


SEQ ID NO:207 J 


[ SEQ ID NO.208 


oca ta \Tnono 
SEQ ID JNvJ.zuy 


SEQ ID NO:210 J 


| SEQIDNO:211 


orA ta XTA«0 1 0 

SEQ ID INU./iz 


SEQ ID NO:213 J 


1 SEQIDNO.214 


ocr\ TT\ XTA«01 *n 

SEQ ID JNiJ.zi^ 


SEQIDNO:216 


| SEQIDNO-.217 


orrrv t A \1A • 0 1 S 

SEQ ID JNU.Zlo 


SEQIDNO:219 


| SEQ ID NO.220 


OCi"\ T A XT A '001 

SEQ ID inu.zzi 


SEQ ID NO:222 


\ SEQ ID NO:223 


OCA T A XT A O 0 A 

SEQ ID jNvJ.zz*^ 


SEQ ID NO:225 


[ SEQ ID NO:226 


oca TA XTA-997 

SEQ ID INU.zz / 


SEQ ID NO:228 


1 SEQ ID NO:229 


OCA T A XTAO'XO 

SEQ ID JN*J./3U 


SEQ ID NO:231 


\ SEQ ID NO:232 


ocA TP* XTO-9^^ 

SEQ ID \S\J.£3j 


SEQ ID NO:234 


[ SEQIDNO-.235 


oca TA XTAO^^ 

SEQ ID 


SEQ ID NO:237 J 


| SEQ ID NO:238 


oca in xjo-9^9 


SEQ ID NO:240 _| 


1 SEQ ID NO:241 


ocA TA XT A -949 

SEQ ID JNvJ.zhz 


SEQ ID NO:243 


| SEQ ID NO:244 


oca TO, MfV94S 


SEQ ID NO:246 | 


SEQIDNO:247 


oca TA XT A • 9 4 8 
SEQ ID INvJ.zho 


SEQ ID NO:249 


SEQ ID NO:250 


ocA TA XT f~V 9 S 1 

SEQ ID in^.zji 


SEQ ID NO:252 


SEQ ID NO:253 


ocA TA XTA'9*S4 

SEQ ID IN^.zjh 


SEQ ID NO:255 


SEQ ID NO:256 


SEQ ID NO:257 


SEQ ID NO:258 


SEQ ID NO-.259 


SEQ ID NO.260 


SEQ ID NO:261 


SEQ ID NO: 262 


SEQ ID NO:263 


SEQ ID NO:264 


SEQ ID NO:265 


SEQ ID NO-.266 


SEQ ID NO:267 
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Target Sequence 


siRNA Sense Strand 


siRNA Antisense Strand 


SEQ ID NO:268 


SEQ ID NO:269 


SEQ ID NO:270 



Table 4 F.xemnlar v RNAj TWer Sequences in the CT,PP1 Gene and the, Corresponding 

siRNAs (LMex Stringent Criteria) 



Target Sequence 



SEQ ID NO:271 



SEQ ID NO:274 



SEQ ID NO:277 



SEQ ID NO.280 



SEQ ID NO:283 



SEQ ID NO: 286 



SEQ ID NO:289 



SEQ ID NO:292 



SEQ ID NO :295 



SEQ ID NO:298 



SEQ ID NO-.301 



siRNA Sense Strand 



SEQ ID NO:272 



siRNA Antisense Strand 



SEQ ID NO:273 



SEQ ID NO:275 



SEQ ID NO:278 



SEQ ID NO:281 



SEQ ID NO:284 



SEQ ID NO:287 



SEQ ID NO:276 



SEQ ID NO:279 



SEQ ID NO:282 



SEQ ID NO:285 



SEQ ID NO-.288 



SEQ ID NO-.290 



SEQ ID NO:293 



SEQIDNO:291 



SEQIDNO:294 



SEQ ID NO:296 



SEQ ID NO:299 



SEQ ID NO:297 



SEQ ID NO:300 



SEQ ID NO:303 



SEQIDNO:302 

[0110] In yet another embodiment, the polynucleotides of the present invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve the stability, 
hybridization, or solubility of the molecules. For instance, the deoxyribose phosphate 
backbone of the polynucleotide molecules can be modified to generate peptide 
polynucleotides (see Hyrup et al, Bioorganic & Medicinal Chemistry 4:523, 1996). As 
used herein, the terms "peptide polynucleotides" or "PNAs" refer to polynucleotide mimics, 
e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. PNA oligomers can be synthesized using standard 
solid phase peptide synthesis protocols. 

[0111] PNAs can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense agents for sequence-specific modulation of the CLPP1 gene 
expression. PNAs can also be used in the analysis of single base pair mutations in a gene, 
(e.g., by PNA-directed PCR clamping); as artificial restriction enzymes when used in 
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combination with other enzymes, {e.g., SI nucleases); or as probes or primers for DNA 
sequencing or hybridization. 

[0112] In one embodiment, PNAs can be modified to enhance their stability or cellular 
uptake by attaching lipophilic or other helper groups to PNA, by the formation of PNA- 
DNA chimeras, or by the use of liposomes or other drug delivery techniques known in the 
art. For example, PNA-DNA chimeras of the polynucleotides of the invention can be 
generated. These chimeras allow DNA recognition enzymes, such as RNase H and DNA 
polymerases, to interact with the DNA portion while the PNA portion provides high binding 
affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate 
lengths which are selected based on base stacking, number of bonds between the 
nucleobases, and orientations. The PNA-DNA chimeras can be synthesized as follows. A 
DNA chain is synthesized on a solid support using standard phosphoramidite coupling 
chemistry and modified nucleoside analogs. PNA monomers are then coupled in a stepwise 
manner to produce a chimeric molecule with a 5 ' PNA segment and a 3 ' DNA segment. 
Alternatively, chimeric molecules can be synthesized with a 5 'DNA segment and a 3'PNA 
segment. 

[0113] In other embodiments, the polynucleotides of this invention may include other 
appended groups such as peptides {e.g., for targeting host cell receptors in vivo), or agents 
facilitating transportation across the cell membrane or the blood-kidney barrier {see e.g., 
PCT Publication No. W089/10134). In addition, polynucleotides can be modified using 
hybridization-triggered cleavage agents or intercalating agents. To this end, the 
polynucleotides can be conjugated to another molecule {e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, or hybridization-triggered cleavage agent). 
Furthermore, the polynucleotide can be detectably labeled. 

Polype ptides and Variants Thereof 

[0114] Several aspects of the invention pertain to isolated CLPP1 polypeptides and 
mutated CLPP1 polypeptides capable of inhibiting normal CLPP1 activity. The present 
invention also contemplates immunogenic polypeptide fragments suitable for raising anti- 
CLPP1 antibodies. 

[0115] In one embodiment, native CLPP1 polypeptides can be isolated from cells or 
tissue sources by using standard protein purification techniques. Standard purification 
methods include electrophoresis, molecular, immunological and chromatographic 
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techniques. Specific examples include ion exchange, hydrophobic, affinity or reverse-phase 
HPLC chromatography, and chromatofocusing. In one embodiment, CLPP1 polypeptides 
are purified using a standard affinity column coupled with anti-CLPPl antibodies. 
Ultrafiltration and diafiltration techniques can also be used. The degree of purification 
depends on the purpose of the use of the CLPP1 polypeptides. In some instances, 
purification is not necessary. 

[0116] In another embodiment, CLPP1 polypeptides or mutated CLPP1 polypeptides 
capable of inhibiting normal CLPP1 activity are produced by recombinant DNA techniques. 
Alternative to recombinant expression, CLPP1 polypeptides or mutated CLPP1 
polypeptides can be synthesized chemically using standard peptide synthesis techniques. 
[0117] The invention provides CLPP1 polypeptides encoded by the human CLPP1 
gene, or homologs thereof. The polypeptides of this invention can be substantially 
homologous to human CLPP1 (SEQ ID NO:2). Preferably, these polypeptides retain the 
biological activity of the native CLPP1 . In one embodiment, the polypeptides comprise an 
amino acid sequence which is at least about 85% 90%, 95%, 98% or more homologous to 
SEQ ID NO:2. 

[0118] Comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. The percent identity 
between two amino acid sequences can be determined using the Needleman and Wunsch (J. 
Mol. Biol. 48:444-453, 1970) algorithm, or the GAP program in the GCG software package 
which uses either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 
10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. The percent identity between two 
nucleotide sequences can be determined using the GAP program in the GCG software 
package, which uses a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 
and a length weight of 1, 2, 3, 4, 5, or 6. The percent identity between two amino acid or 
nucleotide sequences can also be determined using the algorithm of E. Meyers and W. 
Miller (CABIOS, 4:1 1-17, 1989) which has been incorporated into the ALIGN program 
(version 2.0), or the pairwise BLAST program available at NCBI's BLAST web site. 
[0119] The polypeptide and polynucleotide sequences of the present invention can be 
used as query sequences for searching public databases in order to identify similar 
sequences. The search can be conducted using BLAST programs, such as the protein 
BLAST, nucleotide BLAST, pairwise BLAST, and genomic BLAST, that are available at 
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the BLAST web site maintained by the NCBI. When using BLAST programs, the default 
parameters of the respective programs can also be used. 

[0120] The invention further provides chimeric or fusion CLPP1 polypeptides. A 
fusion CLPP1 polypeptide contains an CLPPl-related polypeptide and a non-CLPPl 
polypeptide. The CLPPl-related polypeptides include all or a portion of SEQ ID NO:2 or 
its variant. A peptide linker sequence can be employed to separate the CLPPl-related 
polypeptide from the non-CLPPl polypeptide components by a distance sufficient to ensure 
that each polypeptide folds into its native secondary and tertiary structures. Such a peptide 
linker sequence is incorporated into the fusion protein using standard techniques well 
known in the art. Suitable peptide linker sequences can be chosen based on the followmg 
factors: (1) their ability to adopt a flexible extended conformation; (2) their inability to 
adopt a secondary structure that could interact with functional epitopes on the CLPPl- 
related polypeptide and non-CLPPl polypeptide; and (3) the lack of hydrophobic or charged 
residues that might react with the polypeptide functional epitopes. Preferred peptide hnker 
sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, such as Thr 
and Ala can also be used in the linker sequence. Amino acid sequences suitable as hnkers 
include those disclosed in Maratea et al, Gene 40:39-46, 1985; Murphy et al., Proc. Natl. 
Acad Sci USA 83:8258-8262, 1986; and U.S. Pat. Nos. 4,935,233 and 4,751,180. The 
linker sequences may be from 1 to about 50 amino acids in length. Linker sequences are 
not required when the CLPPl-related polypeptide or the non-CLPPl polypeptide has non- 
essential N-terminal amino acid regions that can be used to separate the respective 
functional domains and thereby prevent steric interference. 

[0 121] In one embodiment, the fusion protein is a GST-CLPP1 fusion protein in winch 
an CLPPl-related sequence, such as SEQ ID NO:2, is fused to the C-terminus of the GST 
sequence. This fusion protein can facilitate the purification of the recombinant CLPP1. 
[0122] The CLPP1 -fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject. The CLPPl-fusion proteins can 
be used to affect the bioavailability of an CLPP1 substrate. The CLPPl-fusion proteins can 
also be used for the treatment or prevention of damages caused by (i) aberrant modification 
or mutation of CLPP1, or (ii) aberrant post-translational modification of CLPP1. It is also 
eonceivable that a fusion protein containing a normal or mutated CLPP1 polypeptide, or a 
fragment thereof, can be used to inhibit CLPP1 activity in a human subject. 
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[0123] Moreover, the CLPPl-fusion proteins can be used as immunogens to produce 
anti-CLPPl antibodies. They can also be used to purify CLPP1 ligands and to screen for 
molecules capable of inhibiting the interaction between CLPP1 and its substrates. 
[0124] Preferably, the CLPP1 -chimeric or fusion proteins of the invention are produced 
using standard recombinant DNA techniques. Commercially available expression vectors 
which encode a fusion moiety (e.g., a GST polypeptide) can be used. 
[0125] A signal sequence can be used to facilitate secretion and isolation of the 
secreted protein or other proteins of interest. Signal sequences are typically characterized 
by a core of hydrophobic amino acids which are generally cleaved from the mature protem. 
Such signal peptides contain processing sites that allow cleavage of the signal sequence 
from the mature proteins as they pass through the secretory pathway. The present invention 
encompasses CLPP1 polypeptides having a signal sequence, or the polynucleotide 
sequences encoding the same. 

[0126] The present invention also pertains to CLPP1 mutants which function as 
antagonists toCLPPl. In one embodiment, antagonists of CLPP1 are used as therapeutic 
agents. For example, a mutant ofCLPPl that forms a non-functional dimer with a wide- 
type CLPP1 (the so-called dominant negative mutant) can decrease the activity ofCLPPl 
and may ameliorate diseases in a subject wherein CLPP1 are abnormally increased m level 
or activity. Dominant negative CLPP1 mutants can be generated by mutagenesis, as 
appreciated by one skilled in the art. 

[0127] CLPP1 mutants which function as either CLPP1 agonists or antagonists can be 
identified by screening combinatorial libraries of mutants. A variegated library of CLPP1 
mutants can be produced by, for example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a degenerate set of potential CLPP1 
sequences is expressible as individual polypeptides, or alternatively, as a set of larger fusion 
proteins containing the set ofCLPPl sequences therein. There are a variety of methods 
which can be used to produce libraries of potential CLPP1 mutants from a degenerate 
oligonucleotide sequence. A degenerate gene sequence can be chemically synthesized 
using an automatic DNA synthesizer. The synthetic gene can then be ligated into an 
appropriate expression vector. 

[0128] In one embodiment, a library of coding sequences can be generated using 
nuclease, For instance, double stranded PCR fragments of the CLPP1 coding sequence can 
be treated by a nuclease which produces about one nick per molecule. The double-stranded 



NVA279520 



34 



DNAs then are subject to a cycle of denaturing and re-naturing. The newly reformed 
DNAs, which may include sense/antisense pairs from different nicked products, are treated 
with SI nuclease to remove single stranded portions. Using this method, an expression 
library which encodes N-terminal, C-terminal or internal fragments of CLPPl can be 
derived. 

[0129] In addition, recursive ensemble mutagenesis (REM), a technique which 
enhances the frequency of functional mutants in the libraries, can be used to prepare CLPPl 
mutants (Delgrave et al, Protein Engineering 6:327-331, 1993). 

[0130] CLPPl fragments, or variants thereof, can also be generated using synthetic 
means, such as solid-phase synthesis methods. Preferably, the synthesized fragment has 
less than about 100 amino acids, or preferably, less than about 50 amino acids, 
fragments, or variants thereof, can also be generated using synthetic means, such as solid- 
phase synthesis methods. Preferably, the synthesized fragment has less than about 100 
amino acids, or preferably, less than about 50 amino acids. 

Antibodies 

[0131] In accordance with another aspect of the present invention, antibodies specific 
to CLPPl or its variants are prepared. An antibody is considered to bind "specifically" to an 
antigen if the binding affinity between the antibody and the antigen is equal to, or greater 
than 10 s M" 1 . The antibodies can be monoclonal or polyclonal. Preferably, the antibodies 
are monoclonal. More preferably, the antibodies are humanized antibodies. 
[0132] Polyclonal anti-CLPPl antibodies can be prepared by immunizing a suitable 
subject with CLPPl or fragments thereof. The anti-CLPPl antibody titer in the immunized 
subject can be monitored over the time using standard techniques, such as ELISA. The anti- 
CLPPl antibody can be isolated from the immunized subject using well known techniques. 
[0133] In one embodiment, hybridomas capable of producing anti-CLPPl antibodies 
are prepared. Purified CLPPl or its variants, or fragments thereof, are used to immunize a 
vertebrate, such as a mammal. Suitable mammals include mice, rabbits and sheep. 
Preferably, the fragment used for immunization comprises at least 8 amino acid residues, 
more preferably at least 12 amino acid residues, highly preferably at least 16 amino acid 
residues, and most preferably at least 20 amino acid residues. 

[0134] Immunogenic fragments (epitopes) of CLPPl can be identified using well 
known techniques. In general, any fragment of SEQ ID NO:2 can be used to raise 
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antibodies specific to CLPP1. Preferred epitopes are regions that are located on the surface 
of CLPP1 . These regions are usually hydrophilic. 

[0135] Splenocytes are isolated from the immunized vertebrate and fused with an 
immortalized cell line (such as a myeloma) to form hybridomas. Preferably, the immortal 
cell line is derived from the same mammalian species as the lymphocytes. For example, 
murine hybridomas can be made by fusing an immortalized mouse cell line with 
lymphocytes isolated from a mouse that is immunized with an immunogenic preparation of 
the present invention. Preferred immortalized cell lines include mouse myeloma cell lines 
that are sensitive to culture medium containing hypoxanthine, aminopterin and thymidine 
("HAT medium"). Suitable myeloma cell lines include, but are not limited to, the 
P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp210-Agl4 myeloma lines, all of which are available 
from ATCC. In one embodiment, HAT-sensitive mouse myeloma cells are fused to mouse 
splenocytes using polyethylene glycol ("PEG"). Hybridoma cells thus produced are 
selected against HAT medium, which kills unfused or unproductively fused myeloma cells. 
Hybridoma cells which produce monoclonal anti-CLPPl antibodies are then detected by 
screening the hybridoma culture supernatants. 

[0136] A monoclonal anti-CLPPl antibody can also be prepared by screening a 
recombinant combinatorial immunoglobulin library {e.g., an antibody phase display library). 
Kits for generating and screening phage display libraries are commercially available (e.g., 
the Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene SurfZAP™ Phage Display Kit, Catalog No. 240612). 

[0137] The anti-CLPPl antibodies of the present invention also include "single-chain 
Fv" or "scFv." The scFv fragments comprise the V H and V L domains of an antibody. 
Generally, the scFv polypeptide further comprises a polypeptide linker between the V H and 
V L domains. The polypeptide linker enables the scFv to form the desired structure for 
antigen binding. Additionally, recombinant anti-CLPPl antibodies, such as chimeric and 
humanized monoclonal antibodies, can be prepared, as appreciated by one of ordinary skill 
in the art. 

[0138] Humanized antibodies are particularly desirable for therapeutic treatment of 
human subjects. Humanized forms of non-human (e.g., murine) antibodies are chimeric 
immunoglobulins, immunoglobulin chains, or fragments thereof (such as Fv, Fab, Fab', 
F(ab') 2 or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. Humanized antibodies are derived 
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from human immunoglobulins in which the residues forming the complementary 
determining regions (CDRs) are replaced by the residues from CDRs of a non-human 
antibody, such as a mouse, rat or rabbit antibody having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of the human immunoglobulin are 
replaced by corresponding non-human residues. Humanized antibodies may also comprise 
residues which are found neither in the recipient antibody nor in the imported CDR or 
framework sequences. The humanized antibody can comprise at least one or two variable 
domains, in which all or substantially all of the CDR regions correspond to those of a non- 
human immunoglobulin and all or substantially all of the constant regions are those of a 
human immunoglobulin consensus sequence. The humanized antibody preferably 
comprises at least a portion of an immunoglobulin constant region (Fc) of a human 
immunoglobulin. 

[0139] Humanized antibodies can be produced using transgenic mice which are 
incapable of expressing endogenous immunoglobulin heavy and light chains but can express 
human heavy and light chains. The transgenic mice are immunized in the normal fashion 
with a selected antigen. Monoclonal antibodies directed against the antigen can be obtained 
using conventional hybridoma technology. The human immunoglobulin transgenes 
harbored in the transgenic mice rearrange during B cell differentiation, and subsequently 
undergo class switching and somatic mutation. Using this technique, therapeutically useful 
IgG, IgA and IgE antibodies can be prepared. 

[0140] In addition, humanized antibodies which recognize a selected epitope can be 
generated using a technique referred to as "guided selection." In this approach a selected 
non-human monoclonal antibody, e.g., a murine antibody, is used to guide the selection of a 
humanized antibody recognizing the same epitope. 

[0141] In a preferred embodiment, the antibodies to CLPP1 are capable of reducing or 
eliminating the biological function of CLPP1. Preferably, the antibodies reduce at least 
25% of CLPP1 activity. More preferably, the antibodies reduce at least about 50% of the 
activity. Highly preferably, the antibodies reduce about 95-100% of CLPP1 activity. 
[0142] Anti-CLPPl antibodies can be used to isolate CLPP1 . Suitable methods include 
affinity chromatography and immunoprecipitation. Moreover, anti-CLPPl antibodies can 
be used to evaluate the expression level of CLPP1. Anti-CLPPl antibodies can also be used 
to monitor CLPP1 level as part of a clinical testing procedure, or to determine the efficacy 
of a given treatment regimen. Detection can be facilitated by coupling the antibody to a 
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detectable substance. Examples of detectable substances include various enzymes, 
prosthetic groups, fluorescent materials, luminescent materials, bioluminescent materials, 
and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, galactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent materials 
include luciferase, luciferin, and aequorin; and examples of suitable radioactive materials 
include 125 I, ,31 I, 35 S or 3 H. 

[0143] Anti-CLPP 1 antibodies are also useful for targeting a therapeutic agent/drug to 
a particular cell or tissue. The therapeutic agent/drug may be coupled to an antibody, either 
covalently or non-covalently. For instance, a therapeutic agent can be coupled to an 
antibody via a linker group. A linker group can function as a spacer to separate the 
antibody from the agent so as to avoid interference with antibody's binding capabilities. 
The linker group can also serve to increase the chemical reactivity of a substituent on the 
agent or the antibody, and thus increase the coupling efficiency. A variety of Afunctional 
or polyfunctional reagents, either homo- or hetero- functional (such as those described in the 
catalog of the Pierce Chemical Co., Rockford, 111.), can be employed as the linker group. 
Coupling may be effected, for example, through amino groups, carboxyl groups, sulfhydryl 
groups or oxidized carbohydrate residues. There are numerous references describing this 
methodology. See, e.g., U.S. Pat. No. 4,67 1 ,958. 

[0144] Where a therapeutic agent is more potent when free from the antibody, it may 
be desirable to use a linker group which is cleavable during or upon internalization into the 
target cell. A number of different cleavable linker groups have been described. The 
mechanisms for the intracellular release of an agent from these linker groups include 
cleavage by reduction of a disulfide bond (e.g., U.S. Pat. No. 4,489,710), by irradiation of a 
photolabile bond (e.g., U.S. Pat. No. 4,625,014), by hydrolysis of derivatized amino acid 
side chains (e.g., U.S. Pat. No. 4,638,045), by serum complement-mediated hydrolysis (e.g., 
U.S. Pat. No. 4,671,958), or by acid-catalyzed hydrolysis (e.g., U.S. Pat. No. 4,569,789). 
[0145] It may also be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple agents are coupled to one antibody molecule. In another 
embodiment, at least two different types of agents are coupled to one antibody. Regardless 



NVA279520 



38 



of the particular embodiment, immunoconjugates coupled with more than one agent can be 
prepared in a variety of ways, as appreciated by one of ordinary skill in the art. 

Vectors. Expression Vectors and G ene Delivery Vectors 

[0146] Another aspect of the invention pertains to vectors containing a polynucleotide 
encoding CLPP1 or a portion thereof. One type of vector is a "plasmid," which includes a 
circular double stranded DNA into which additional DNA segments can be introduced. 
Vectors also include expression vectors and gene delivery vectors. 
[0147] The expression vectors of the present invention comprise a polynucleotide 
encoding CLPP1 or a portion thereof. The expression vectors also include one or more 
regulatory sequences operably linked to the polynucleotide being expressed. These 
regulatory sequences are selected based on the type of host cells. It will be appreciated by 
those skilled in the art that the design of the expression vector depends on such factors as 
the choice of the host cells and the desired expression levels. CLPP1 can be expressed in 
bacterial cells such as E. coli, insect cells (using baculovirus expression vectors), yeast cells 
or mammalian cells. The expression vector can also be transcribed and translated in vitro, 
for example, by using T7 promoter regulatory sequences and T7 polymerase. 
[0148] Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of the recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification of 
the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Suitable cleavage 
enzymes include Factor Xa, thrombin and enterokinase. Examples of fusion expression 
vectors include pGEX (Pharmacia Piscataway, NJ), pMAL (New England Biolabs, Beverly, 
MA) and pRITS (Pharmacia, Piscataway, NJ). Purified fusion proteins can be utilized in 
CLPP1 activity assays, or to generate antibodies specific for CLPP1. 
[0149] Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc and pET 1 Id. Target gene expression from the pTrc vector relies on host RNA 
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po.yn.smse transcription from a hybrid trp-.ac fusion promote,. Targe, gene expresston 
from me pET . Id vector re.ies on transcription from a T7 gniO-lac fttsion promoter 
mediated by a co-expressed vira. PNA polymerase (T7 gn,). This viral polymerase ,s 
supplied by host strains BL21(DE3) or HSLE174(DE3) from a resident prophage harbonng 
a T7 gnl gene under the transcriptional control of the lacUV 5 promoter. 
t01 50] One strategy to maximize recombinant protein expression m E. coh ts to 
express the protein in host bacteria that have an impaired capacity to proteo.ytica.ly cleave 
fce recombinant protein. Another strategy is to a.ter the po.ynuc.eotide sentence encodmg 
L protein so that the individua. codons for each atnino acid are those preferenttally uU.tzed 



in E. coli. . 
[0,511 in another embodiment, the CLPP. expression vector is a yeas, «*»"» 
vector Examp.es of yeas, expresston vectors mch.de pVepSec, , pMFa, plRYSS P YES2 
(Invifrogen Corporation, San Diego, CA), and picZ (Invitrogen Cotp, San Dtego, CA). 
[0152] Alternatively, CLPP. or its variant can be expressed in insect cel.s usmg 
aculovirus expression vectors. Suitable baculovirus vectors inc.ude the pAc series and the 



pVL senes. r 

u jimpnt CI PP1 or its variant is expressed in mammalian 
[01531 In yet another embodiment, Cl^r i or us> van r 

cells ustng a mammahan expression vector. Examp.es of mammahan expression vectors 
include pCDMS and pMT2PC. When used in mammahan ceHs, tite expresston vector 
eontro, ft.nc.ions are often provided by vira, re g u,a.„ry e.emen., For — g «* 
used promos are derived from polyoma, adenovirus 2, cytontega.ovtrus and S,m.an V,rus 

40. 

. 01 54] in anotiter embodimen., .he mammalian expression vector contatns t,ssue- 
S peeif,c regu,a.ory elements. Exantp.es of suhable tissue-specifc p— mc^he 
Hver-specifc albumin promoter, .ymphoid-specif.c promote., promoters of T cell tecep.ors 
and immunog.obu.ms, neuron-specific promoters (e.g., the neurofi.amen. promoter), 
pancreas-specific promoters, and mammary g.and-specific promoters (eg., m.Ht whey 
promoter). Developmental.y-regulated promoters are also co„.emp,a,ed, wmeh tuc.ude, for 
example, the a-fetoprotein promoter. 

.01S51 The present invention also provides a recombinant expresston vector 
comprising a po.ynucleotide which encodes CLPP. but is e.oned into .he expression vector 
in an antisense orientation. Regulatory sequences <ha« am operative.y .inked to .he 
an.isense-orien.ed polynucleotide can be chosen to direct continuous expresston of the 
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antisense RNA molecule in a varied of cell types. Suable regulatory sequences mclude 
viral promoters and/or enhancers. Regulatory sequences can also be chosen .o three, 
constifutive, tissue specific or cel. type specific expression of.be antisense RNA. The 
antisense expression vector can be in .he form of a recombinan, plasmid, phagem.d, or 
attenuated virus in which an,ise„se po.ynucleo.ides are produced under the eontto. of a 
highly efficient regulatory region. 

[0156] The present invention further provides gene delivery vehic.es for dehvenng 
polynucleotides ,„ mamma,, A polynucleotide sequence of .he invention can be 
administered Cher locally or systemically via a gene delivery vehicle. Expression o the 
polynucleotide can be induced using endogenous mammahan or hetero.ogous promoters. 
Expression of the polynucleotide in vivo can be eiiher cons.itn.ed or regulate* The gene 
delivery vehicles preferably are viral vectors, including retroviral, lentiviral, adenovrral, 
adeno-associated viral (AAV), herpes viral, or alphavims vectors. The viral vecters con 
a,so be astrovirus, coronavirus, orthomyxovirus, papovavirus, paramyxovirus, parvovrus, 
picomavirns, poxvirus, or togavirus vectors. 

[01571 Delivery of gene therapy cons«rnc.s is no, limited ,o me above menttoned vrral 
vectors. Cher delivery methods can also be cmp.oyed. These methods indude nuclete ac.d 
expression vectors, polycattonic condensed DNA linked or unlinked ,o kilted adenovnus, 
Hgand .inked DNA, liposome-DNA conjugates, gene guns, ionizing radiatton, nuclete 
charge neuttalization, or fusion wnh eel, membmnes. Naked DNA can afso be employed. 
Uptake efficiency of me naked DNA may be improved using biodegradabfe latex bead., 
litis method can be former improved by treating me beads ,o increase meir hydrophob.ctty. 

u, „,lMMe F t"™'"" Svswms , 
[0158] Ano,her aspec, of the present invention pertains to the use of regulable 
expression systems ,o express desirable polynucleotides or polypeprides in ecus. Systems 
sniteble for this invention are briefly described below: 

[0,591 Te.-on/off system. The Tef-system is based on two regulatory elements dertved 
from me tettacycline-resistence operon of the E. coli Tn.O ttansposom me te, repressor 
protein (TetR) and ,he Te, operate, DNA sequence (,e.O) ,o which Te,R bmds (Gossan e, 
al Science268 : 1766-f769,f995). The system consists of two components, a regulator 
and a "reporter" plasmid. The "regulator" plasmid encodes a hybrid protein contenting a 
muteted Te, repressor (rte.R) fused ,o .he VP16 activation domain of herpes simplex v,rus. 
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The "reporter" plasmid contain, a .expensive Cement (TRE), wmch co»tio,s fire 
"reporter" gene of choice. The rtetR-VP.6 fusion protein can only bind to the TRE, 
therefore activating the transcription of Ore "reporter" gene in the presence 
The system has been incorporated into a number of vira! vector, including re— 
adenovirus and AAV. 

[0160] Ecdysone sys,e m . The ecdysone system is based on the molting mductron 
system found in DrosopHila, bu, modified for inducible expression in mammahan cells. 
The system uses an analog of fire DrosopMa steroid hormone ecdysone, munsterone 
activate expression o, the gene of interest via a heterodimeric nuclear receptor. Expression 
HI have been reported to exceed 200-fo,d over basa, ,eve,a with no effect on mammahan 
eel.physio 1 ogy(Noe,a..,Proc.Na. 1 .Acad.Sci.USA93 : 3346-3351,1996). 

cellpnys gyr nrottesterone receptor is normally stimulated to bmd 

[0161] Progesterone-system, ineprogesiciun^ f 

,„ a specific DNA sequence and <o activate transcription through an interaction w.ut .« 

the progesterone receptor rha, can be stimu.ated to bind through an tnteractton wr* RU486 
has been generated. To generate a specific, regu.atab.e transcription factor, fire RU486- 
hiding domain of the progesrerone recepror has been fnsed to the DNA-bindmg domam of 
the yeas, transcription factor GAM and the transaction domain of the HSV pro em 
VP,6 The chimeric factor is inactive in the absence of RU486. The addrfon of hormone, 
however, induces a eonformationa, change in the chimeric protein, and this change allows 
binding to a GAL4-binding site and fire activation of transcription ftom promoters 
containmg the GAL4-binding srte (Wang e, al„ Nat. Biotech 15: 239-243, 1997). 
Zm bnmunosuppressrve agents, such as FK506 and rapamycm, 

c, by binding to specific cellular proteins and faceting their dimerization. For examp e, 
act oy oi.iu. » . . ,„™, -..ui,. ;„ i K heterodtaenzation 

fite binding of rapamycin to FK506-bmdmg protem (FKBP) remits nr.ts 
with another rapamycin binding protein FRAP, which can be reversed by removal of the 
drug. The ability to bring two proteins together by addition of a drug po.en.ta.es the 
reg iation of a number of bio.ogica, processes, including transcription. A chtmene DN A- 

a specific DNA-binding sequence. A transcription* activation domam a.so has been fused 
to FRAP. When .hese two fusion proteins are co-expressed in the same ce.l, a fully 
functional transcription factor can be formed by heterodimerization mediated by addmon of 
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rapamycin The dimerized chimeric transcription factor can then bind to a synthetic 

L been success*.,,, integrated ,n«o adenovira, and AAV vectors. Long .emr regu ateb.e 
g e„e expression has been achieved in boo, mice and baboons (Ye e. a,., Scenee 283. 88-91, 



1999). 



pelection Methods 

[0163] in patients with disorders re!a,ed .0 the aberrant expression of CLPP1. The , 
expression level of CLPP1 can be used as an indicator for detecting the presence o CLPPl- 
rel a,ed diseases in humans. Detection and measurement of me relative amount of the 
CLPP1 gene product can be carried out using various methods known m the art. 
,01641 Typical methodo.ogies for detecting the transcription level of a gene mclude 

RKA or derivative thereof (such as cDNA or cRUA), and de.ecting me probe. Sunabl. 
maWs inchrde Norihent B,o. and mutative RCR or RT-PCR. fa «. hybndrzation can 
a,so be used .0 detect me Option level of theCLPPl gene in human tissues. 
.0165] Typical methodo.ogies for detecting a polypeptide include extracting protems 

antibody. Sui.ab!e methods include enzyme .inked immunosorbent assays (ELISAs), 
Western blots, rmmunoprecipitations, and immunofluorescence. The antibody can be 
polyclonal, or preferab.y, monoclonal. The antibody can be an intact antibody or a 
Unen, thereof (e.g. Fab or F(ab-)a ). The antibody can be !abe,ed with a raAo^a 
fluorescent compound, an enzyme, an enzyme co-factor, or a <^^™~* 
..,abe,ed,» with regard ,o a probe or antibody, is intended to encompass drrec, labehng such 
as ouou^t covalen, couphng, as well as indirect labehng such as being medrated by a„o.her 
Z* which is directly ,abe,ed. Examp.es of indirect .abe.ing inc.ude .abe.tng a pnmary 

oiotin which can be detected, for examp.e, by a fluorescence-^ streptavrdur 
[0,66, Preferably, the binding affinity of .he antibody .o CLPP. is a. leas. 
More preferabry, .he binding affinity is a, leas. Iff M". Other methods such as 
e.ectrophoresis, chromatography or direct syncing can a.so be use. .0 de.ec. tire amount 
of a po.ypep.ide in a biological sample. Anti-CLPP. antibodies can a.so be dtrectiy 
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tatro duced into a subject. The antibody can be labeled w«h a radioachve marker who. 
.Lence and location in the subject can be de.ec.eo using standard 
"Zn >" one embodiment, .be genomic copies of the CLPPI gene m *. genome of a 
Lan subject may indicate the presence or predisposition of a disease. DeteCon of the 

lethods known in Ute art. For tnstance, i, can be assessed u,ng Southern Bo, Theses 
for Southern B,o, can be labeled with a radioisotope, a fluorescent compound, an enzyme, 

2 I determine the seveHty of CLPP,-re,a,ed disease, A bio.ogica. sample , ,so,a«ed 

re ,ative to a norma, or con.ro> samp.e is evahtated. The expresston eve of CLP « * 
bio.ogica, samp.e can indicate me presence or severity of CLPP,-re,a,ed d,sease ,n the 
^ 1 "hioiogica. samp.e" is intended .o inciude tissues, cefls or b.o.og, a, 
fluids is o,a,ed from flte subject. A preferred bio,og,ca, S amp,e is a serom samp.e .solated 
from the subject using conventional means. 

ST'ES— a.aoprovfdeamefltodaforiden.ifyfngCLPP.modu^. 

peptides, pep.idomime.ies, pep.oids, po,ynuc,eo,ides, smafl mo,ecu,es or o*er drugs. 
These moieties - either bind to CLPP 1, or have a modu.arory (eg. , sbmmatory or 
inbibitmy) effect on me activity of CLPPI . In one embodiment flte motenes have a 

These moieties can a,so exeri a modu,a,ory effect on .he ^^^^ 
.says of .he present invention comprise detecting the interacts between CLPP. and 

;;;r nlS The.es,compounds of thepresen, invention can be e.ther smafl mo.ecn.es or 
inactive agents, .n a preferred embodiment, the tea, compound is a smafl 
taor ganic mo,eeu,e. .n another preferred embodiment, the test compound „ a po.ypepfdes, 
oligopeptides, polysaccharides, nuc.eo.ides or polynucleotides. 
[Cm, ,n accordance with one aspect of una invention, methods for screemng for 
compounds ,ba, inhibit me biological activities of CLPPI are prov.ded. P_.,ca, 
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substantially decreases the expression level or aettvtttes of CLPP1 . The sere 

» be carried ou, either in vivo or for ^ for ^pounds 

r0172l The present invention further includes a mci 

Jterm "binding partner" refers ,o a hioaetive agent which serves as etUter a snbshate 
CL Toraligan having a binding afftnity to CLPP1. The bioacrive agen, may be 

polysaccharides, nucleotides or polynncleottdes. maybeuS edas 
TOTS, Inhibitors of the express™, active or bindrng abthty of CLPP1 may be us 
Seuric conrposlrions. These inhrbitors cao be — ed in suitable pha— cal 

rprovideamerhodsforconducringhigh^ughpn. 
c ed ing for compounds capable of inhib.ting activity or expression of CLPP1. .n on 

with CLPP1, and men de.ee.ing the effee, of .he ,es. compounds on CLPP1. FuncUo 

FLIPR® Molecular Devices Corp, Sunnyvale, CA), or the TUNEL assay, ca P 
o llare CLPP1 ce.lu.ar acrivhy. Fluorescence-based technics can be used for htgh- 
to measure ^rr ed 
throughput and ultra high-throughput screening. They include, but a 
BRET® and FRET® (both by Packard Instrument Co., Menden, CT). 

rJnsurfaclp,moo»avesare 
jdin.eribce. B, -cring d.rec.ed ligh. frirm me su^ = U 
of eonrae, with a sample, me surface plasmon resonance causes a changetn^he reftactiv 
• , ,be surface .ayer The refractive index change for a g,ven change of mass 

InThpids and p„,ynuc,eo.ides), and since ,he B.ACORE® sensor surface can be 
test compounds can thus be accomplished. 
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[0X76! Using the CLPP1 detection methods of Otis invention, the efficacy of 
therapeutic agent for CLPP.-re.ated diseases can be monitored dnring ctintca. mals. The 
rherapeutic agent may he a drug, sma„ molecule, agonist, antagonist, pep— e «c 
protein peptide, or polynucleotide. The changes in the expression or active of tine CLPP 

the agen on patients with CLPP.-related disease, In addition, me exp—r achvtty 
of CLP 1 in response to the agen, can be measured a. various points during me cluneal 



,0,77, in a pteferred embodiment, me method for monitoring the effectiveness of the 
therapeutic agen. includes the steps of (i) obtaining a pre-admin.stration sample from a 

sample; (iii) obtaining one or more post-administration samples from the subject; (tv) 

de«e!,i the ,eve, of expression or activity of CLPP, in the post-administratton samples, 

detecting me r . . „,„ f riPPl in the nre-administration sample 

(v) comparing the level of expression or activty of CLPP1 m the pre 

,„ the .eve, of expression or activity of CLPP, in the post administration samples. The 

dose or frequency of the administration of the agent may be adjusted based on the 

ooseoriren Th . ra f or . CLPP1 expression or activity 

effectiveness of the agen, in a particular patient. Therefore, CL P 

can be used as an indicator of the effectiveness of a therapeutic agen. for CLPP1 re,a,ed 

diseases, even if ,he agen, does no, produce an observable phenotypic response. 

or a. risk of developing CLPP.-re.a.ed diseases. In addition, me de.ec.ion methods canbe 
used to determine wheurer an agen, (,g., an agonis,, antagonist, peptidomimebc prolan 
peptide, polynucleotide, small molecule, or other drug candidate) can be admtmstered 
subject for effectively treating or preventing CLPP.-rela.ed diseases. 
,0,79, CLPP1 expression profiles a. different progression stages of CLPP 1 -related 
diseases can be establish*. In addition, CLPP. expression profiles in different patient. 
who have different responses to a d„g treatment are determine.. A *"^Z 
such tha, a particular expression profile may be corre.ated to an tncreased hke. hood of a 
poor prognosis. Therefore, the prognostic assay of the present invention may be used to 
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detemtine whether a subject undergoing a treatment for a C LPP,-re.a.ed disease has a p~r 
ou,.ook for long term survival or disease progression. Preferably, prognosis .a performed 
shortly after diagnosis, sueh as within a few days after diagnosis. The result of prognos, 
can then he used to devise individualized treatment program, .hereby enhancmg the 
effectiveness of the treatment as well as the likelihood of long-term survival and well being. 
mm The method of the invention can also be used to de.ec. genetic alterations m the 
CLPP1 gene, thereby determining if a subject with the altered gene is a, risk for damages 
characterized by aberrant regulation in CLPP1 activity or expression. In a preferred 
embodiment, the method includes detecting the presence or absence of a genettc aheratton 
,ha, affects the integrity of the CLPP1 gene, or detectingthe aberrant expression of tire ^ 
CLPP1 gene The genetic alteration can be detected by ascertaining the existence of a. leaa, 
„„e of the following: 1) deletion of one or more nucleotides from the CLPP1 gene; 2) 
addition of one or more nucleotides to the CLPP. gene; 3) substitution of one or more 
nucleotides of the CLPP1 gene, 4) a chromosomal rearrangement in the CLPP1 gene; 5) 
aheration in the level of a messenger RNA transcript of the CLPP. gene, 6) aberrant 
unification of the CLPP. gene, 7) the presence of a non-wild type splicing pattern of a 
lessenger RNA — of the CLPP. gene, 8> non-wild type level ™ ^0 - 
of a CLPP1 gene, and 10) inappropriate post-translational modificatton of CLPP1. 
,01811 In one embodiment, detection of the alteration involves the use of a 
probe/primer in a polymerase chain reaction (such as anchor PCR or RACE PCR) or 
alternatively, in a ligation Cam reaction (LCR). LCR can be particularly 
detecting point mutations in the CLPP1 gene. This memod includes the steps of collating 
sample from a subject, isolating polynucleotides (e.g., genomic UNA, mRNA, or bo*) from 
the sample, contacting the polynucleotide with one or more primers which spectfically 
hybridize to tire CLPP1 gene or gene product, and detecting the presence or absence of an 
amplification product, o, detecting the size of the amplification prcAic, and companng tts 
tength to a control. It is understood that PCR and/or LCR can be used as a preliminary 
amplification step in conjunction with any outer techniques described herein. 
[0182] Alternative amplification methods include: self sustained sequence replication 
(Guatelli e, a,., Proc. Nat,. Acad. Sei. USA 87:1874-1878, 19,0), ttanscrip tiona. 
amplification system (Kwoh e, a,., Proc. M Acad. Sci. USA 86:1 173-1 .77, ,989), and 
Q-Beta Replicase (Lizardi, a at, Bio-Technology 6:1 197, 1988). 
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„ ,„ another embodiment, mutations in the CLPP1 gene can be identified using 

Lion enzymes. Differences in res.nc.ion enzyme digestion patterns ind.ca.es 
restncfo nzym 

mutation(s) in the CLPP 1 gene or iu> v ... p t N 

can be nsed to detect the presence of specific mutations. See, for example, U.S. No. 

m y« another emhodrment, genetic motions in the CLPP. gene can be 

Tween the two sequences. This step afiows dte identificafion of point mutahon, Th,s 
1 T „ iowed by a second hybridization array mat afiows fite charaCerizaUon of spe file 
2^1 by using smafier and specialized probe arrays which are commentary to a„ 
:22 l««L deteced. --u,fio» array is compos, of^— one 
commentary to the wfid-type gene and the other complementary ,0 *~ „ 

rOWSl In s.i.1 another embodiment, any sequencing reacrons known m the art can b 
Zl direcfiy sequence the CLPP, gene in order to derec, mutations. It ,s contemned 
^auromatedaeouencingp— canbenfifizea.inc.udingse^encmgbymaaa 

mm"^ one embodiment, protection from cleavage agents is used to detect 

(-labeled, containing the wild-type CUT i gene s»c 4 r 
rlinefi fromafissue sample. - oouble-strandcd duplex are rreated^ 
ag e„, which cleaves single-strandec regions of the duplex. The agent may be KN (for 
WA/DNA duplexes), or S, nuclease (for DNA/DNA hybrids). In one case erther 

pipcridine and osmium tetroxide, in order ,0 diges. mismatched regtons. After me 

ftomwhichthesite(s)ofmutationmaybedeterm,ned. ,„ v , oneoI 
[01 87, In a preferred embodiment, the mismatch cleavage reacon employs one or 

these proteins include "DNA mismatch repatr" enzyme, For ms^ce, the mu,Y enzyme 
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cleaves T at G/T mismatches. In one case, cDNAs are prep 

« ce„s. The cDNAs are ,hen hybridiaed ,0 a prohe denveti from rhe CLPP ^ T 
prodacts, if any, can he detected from eiectrophoresis protocols or me I,ke. Sae 



"I'! remhodtmen,, — in eiecirophoretic mohiiiry are nac^ ,o 
Un.tionaintheCLPP, g ene. Differences in electrophoretic mob.lt,, 

, hism (SSCP) The resulting alteration in eleetrophorettc mobtl.ty enables the 

'IZed asin denaburing gradten, g e, electrophoresis (D CCE, For tins pwpo^A 

GC elamp of approximate., 40 OC-rich base pans can be added to he DN gm 
PCP. In a farther embodiment, a temperatare gradient is ascd m p.ace of a denatanng 

gradient (Rosenbaam and Reissner Biophys, Chem 265:12753,1987). 

Examplesofomer.echmqnesforde.ectingpoin.mn^onsmcladebn.areno, 

IT! to oHsonnCeotide hybridization, selective amplification or selective 

"^e— in o e embodiment, oligonucleotide primers for stifle amplication 
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sy mp,oms or family history of a CLPP.-rCa.ed disease. Any ceH type or .issue in which 
CLPP1 is expressed can be used for prognostic or diagnosUc purposes. 

Prophylactic Methods . 

. - ■ f nr nrevenring diseases associated with 

T01921 This invention also provides methods for preventing 

- xk. m .ihndi comnrise administering to a target 
aberrant CLPP1 expression or activity. The methods compns 

subject an agent which modulates CLPP1 expression or activity. 

2U Subjects a, risk of diseases which are caused by or attributed to aberrant CLPP > 

related disease symptoms in order to prevent or delay CLPP1 rela 

prophylactic agents include mutan, CLPP1 proteins, CLPP1 antagonist agents, or CLPP1 

^^^^^^^^ 
modified, based on knowledge obtained from the study of pharmaoogenom.es. 
Pharmacogenomics includes .he application of genomics technologies, such as gene 
eq ue„ci„ , statistic, genetics, and gene expression analysis, to drugs - 
deal developmen. or on me marxe.. pharmacogenomics can be used «o de ermme 

lotype"). Thus, ano.her aspect of this invention is to provide methods for tadonng - 
Zl, s prophy,ac,ic or .herapentjc treatmen, using CLPP. modu,a.ors according m*e 

rarge, prophylactic or therapeutic treatments .o subjects who will most enefi, fnom *e 
rrlL and to avoid treatment of subject who will experience .oxic drng-related ,de 

^ One pharmacogenomics approach .0 identify genes that predict drug response, 
Low! as "a genome-wide association," reHes primarily on a high-resolution map of me 
Tman genome consisting of aheady known gene-re,a.ed sites (e.g., a »b, a„ehc ^ 
marker map which consists of 60.000-.00.000 polymorphic or variable sites on .he human 
ge „ome, each of which has .wo variants). Such a ruga-resolution genetic map can * 
ompared to a map of the genome of each of a statistics,,, substantia, ^«<*£. 
raking part in a Phase .Ml. drug trial in order to identify genes associated with a p— 
obseL drag response or side effect. Alternatively, such a high resolution map can be 
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generated from a combination of some ten-million known single nucleotide polymorphs 
(SNPs) in the human genome. A «SNP" is a common alteration that occurs in a smgle 
nucleotide base in a stretch of DNA. For example, a SNP may occur once per every 1000 
bases of DNA. A SNP may be involved in a disease process. However, the vast majonry of 
SNPs may be not related to diseases. Given a genetic map based on the occurence of 
SNPs, individuals can be grouped into generic categories depending on a particular pattern 
of SNPs in their individual genome. In such a manner, treatment regimens can be tatlored 
,0 groups of genetically simi.ar individuals, raking into account traits that may b. enmmon 
antong such genCicahy similar individuals. Thus, mapping of the CLPP1 gene to SNP 
maps of patients with CLPPl-re.a.ed diseases may facilitate the identification of drug- 
response-prediction genes. 

.0196) Alternatively, the "candidate gene approach" can be utilized to identtfy genes 
,ha, predict drng response. According to this method, if a gene ma, encodes a dmg target .s 
known, all common variants of that gene can be easily iden.if.ed in the population. It .hen 
can be detemrined if a particular dmg response is associated win. one version of me gene 
versus another. 

[0197] The acttviry of drug metabolizing enzymes is a major detemnnant of bom the 
intensity and duration of dmg action. The discovery of generic polymorphisms of dmg 
metabolizing enzymes (,g„ N-acCylttansferase 2 (NAT 2) and cytochrome P450 enzymes 
CYP2D6 and CYPZC19) has provided an explanation as .o why some subjects do not 
obtain .he expeCed dmg effec«s or show exaggera.ed dmg response and serious .ox.c..y 
after taking me standard and safe dose of a dmg. These polymorphisms am expressed m 
,wo phenotypes in the population, extensive metabolize, and poor metabohzer. The 
prevalence of poor me.abo.izer phenotypes is differen. among different populations. For 
example, me gene coding for CYP2D6 is highly po.ymorphic and several 
been iden.if.ed in poor merabolizers, which all lead «o the absence of functional CYP2D6. 
Poor metabolizes of CYP2D6 and CYP2C19 quite frequently experience exaggerated dmg 
response and side effects when they receive standard doses. If a metabolite i, the active 
therapeutic moiety, poor metabolize, show no therapeutic response. The other extreme are 
the so called ultra-rapid metabolizers who do no. respond to stimdard dose, Recently, me 
mo.ecu.ar basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 
amplification. 
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[0198] In one embodiment, the "gene expression profiling" method can be utilized to 
identify genes that predict dmg response. In this regard, me gene expression profile of an 
anima. dosed with a drug can give an indication of whether the gene pathways related to 
toxicity have been turned on. 

[0199] Information generated from the above phamtacogenomics approaches can be 
used to determine the appropriate dosage or treatment regfmen sutoble for a parheular 
individual. This knowledge car, avoid adverse reactions or therapeutic failure, and therefore 
enhance therapeutic or prophylactic efficiency when treating a subject wtth an CLPP1 
modulator. 

Thera peutic Methods 

[0200] As described above, the present invention includes therapeutic methods for 
treating a subject at risk for, susceptible to, or diagnosed with CLPPl-related diseases. The 
therapeutic methods can be individually tailored based on the subject's drug response 
genotype. Typically, the therapeutic methods comprise modulating the expression or 
activity of CLPP1 in the subject. In one embodiment, the method comprises contacting a 
plurality of cells in the subject with an agent that inhibits the expression or activity of 
CLPP1 Suitable agents include polynucleotides (e.g., an antisense oligonucleotides of 
CLPP1), polypeptides (e.g., a dominant negative mutant of CLPP1), or polysaccharides, 
naturally-occurring target molecules of CLPP1 protein an CLPP1 protein substrate or 
receptor), anti-CLPPl antibodies, CLPP1 antagonists, or other small organic and inorganic 
molecule. They may also include vectors comprising polynucleotides encoding CLPP1 
inhibitors or antisense sequences. Moreover, the agents can be anti-CLPPl antibodies 
eonjugated with therapeutic moieties. Suitable agents can be identified using the screening 
assays of the present invention. 

punrmarputical Compositions 

[0201] The present invention is further directed to pharmaceutical compositions 
comprising an CLPP1 modulator and a pharmaceutically acceptable carrier. As used herein, 
a "pharmaceutically acceptable carrier" is intended to include any and all solvents, 
solubilizers, fillers, stabilizers, binders, absorbents, bases, buffering agents, lubricants, 
controlled release vehicles, diluents, emulsifying agents, humectants, lubricants, dispersion 
media, coatings, antibacterial or antifungal agents, isotonic and absorption delaying agents, 



NVA279520 



52 



and the like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutical* active substances is well-known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof in 
the compositions is contemplated. Supplementary agents can also be incorporated into the 
compositions. 

[0202] A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following components: 
a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, 

glycerine; propylene glycol or other synthetic solvents; antibacterial agents such as benzyl 

alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfate; 

chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or 

phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose. 

pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. 

The parenteral preparation can be enclosed in ampoules, disposable syringes or multiple 

dose vials made of glass or plastic. 

[0203] Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersion. For intravenous administration, suitable carriers include 
physiological saline, bacteriostatic water, Cremophor EL- (BASF, Parsippany, NJ) or 
phosphate buffered saline (PBS). In all cases, the injectable composition should be stenle 
and should be fluid to the extent that easy syringability exists. It must be stable under the 
conditions of manufacture and storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The carrier can be a solvent or 
dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures 
thereof The proper fluidity can be maintained, for example, by the use of a coating such as 
lecithin, by the maintenance of the required particle size in the case of dispersion and by the 
use of surfactants. Prevention of the action of microorganisms can be achieved by vanous 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic 
acid thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
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for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride m the 
composition. Prolonged absorption of the injectable compositions can be brought about by 
including in the composition an agent which delays absorption, for example, aluminum 
monostearate and gelatin. 

[0204] Sterile injectable solutions can be prepared by incorporating the active 
modulator {e.g., an anti-CLPPl antibody, an CLPP1 activity inhibitor, or a gene therapy 
vector expressing antisense nucleotide to CLPP1) in the required amount in an appropriate 
solvent followed by filtered sterilization. Generally, dispersions are prepared by 
incorporating the active compound into a sterile vehicle which contains a basic dispersion 
medium and the required other ingredients from those enumerated above. In the case of 
sterile powders for the preparation of sterile injectable solutions, the preferred methods of 
preparation are vacuum drying and freeze-drying which yields a powder of the active, 
ingredient plus any additional desired ingredient from a previously sterile-filtered solution 
thereof. 

[0205] Oral compositions generally include an inert diluent or an edible earner. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutical* compatible 
binding agents, and/or adjuvant materials can be included as part of the composition. The 
tablets, pills, capsules, troches and the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth 
or gelatin; an excipient such as starch or lactose, a disintegrating agent such as algmic add, 
Primogel, or corn starch; a lubricant such as magnesium stearate or Stertes; a glidant such as 
colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring 
agent such as peppermint, methyl salicylate, or orange flavoring. 

[0206] For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
eg a gas such as carbon dioxide, or a nebulizer. 

[0207] Systemic administration can also be by transmural or transdermal means. For 
transmural or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, and 
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include, for example, for — osal administia.ion, detergents, bite sa ltS , and fustdte aetd 
derivative, Transmural administration ean be accomplished through the use of nasal 
sprays or suppositories. For transdermal administration, the bioactive compounds am 
formulated into ointments, salves, gels, or creams as generally known ,n the art. 
10208] The compounds ean also be prepared in the form of suppositories (e.g., wtth 
conventional suppository bases such as cocoa butler and other glyeerides) or retention 
enemas for rectal delivery. 

[0209] In one embodiment, the therapeutic moieties, which may contain a btoacttve 
compound, are prepared with carriers that will prolec, the compound against raptd 
ehmination from the body, such as a control.ed release formu.ation, including implants and 
noncapsulated delivery systems. Biodegradable, biocompatible polymers ean be used, 
such as ethylene vinyl acetate, polyanhydndes, polyglycolic acid, collagen, polyorthoesters, 
and po.ylac.ic acid. Methods for preparation of such formulations will be apparent to those 
skilled in me art. The materials can also be obtained commercially from e.g. Alza 
Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes 
targeted to infected cells with monoclonal antibodies to viral antigens) ean also be used as 
pharmaceutic* acceptable carriers. These can be prepared according to methods kno™ 
,o those skilled in .he art, for example, as described in U.S. Paten. No. 4,522,81 1. 
10210] I. is especially advantageous to formulate otal or parenteral compostttons m 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as 
used herein includes physically discrete units suited as unitary dosages for the subject to be 
treated; each unit containing a predetermined quantity of active compound calculated to 
produce me desired therapeutic effect in association with .he required pharmaceutical 
carrier The specification for .he dosage unit forms of me invention are dictated by and 
directly dependent on tire unique characteristics of .he active compound and the particular 
therapeutic effec. to be achieved, and .he limitations inherent in the art of compoundtng 
such an active compound for the treatment of individuals. 

10211] Toxicity and therapeutic efficacy of such compounds can be determ.ned by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
thempcu.ica.ly effective in 50% of the population). The dose ratio berween 
therapeutic effects is .he merapeutic index and it can be expressed as the ratio LD50/ED50. 
Compounds which exhibit large therapeutic indices are preferred. While compounds that 
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exhibit .oxic side effects may be used, eare should be taken to design a defivery system ma, 
targets such compounds ,0 me site of affected tissue in order to minimi* po.en.tal damage 
to uninfected cells and, thereby, reduce side effects. 

,02121 The data obtained from the cell culture assays and animal sutdres can be used m 
formulating a range of dosage for use in humans. The dosage of such 
preferably within a range of circulating concensus ,ha, include me ED50 wuh htile or 
no .oxicity. The dosage may vary wimrn mis range depending upon me dosage form 
enrployed and .he rou.e of adminisfratlon utilizer.. For any compound nsed ,n me method of 
.he invention, .he therapeutically effective dose can be estimated initially from ce,l culture 
assays A dose may be formulaled in animal models to achieve a circulating plasma 
concen.ra.ion range tha, includes me IC50 (i.e., .he concenhation of the <»< 
which achieves a half-maximal inhibition of symptoms) as determined m cel. culture Such 
information can be used to more accurately determine useful doses in humans. Uve.s m 
p,asma may be measured, for example, by high performance liquid chromatography. 
[02 13] The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

Kits 

io7l41 The invention also encompasses kits for detecting the presence of a CLPP1 gene 
roduc, in a biological samp.e. An example ki, comprises reagents for assessing express 
of CLPP1 at mRNA or protein level. Preferably, the reagenrs include an anUbody or 
fragmen. .hereof, wherein rhe antibody or fragnren. specifically binds .0 CLPP1. 
Oplrafiy, .he k„s may comprise a po.ynucleo.ide probe capable of specfically b.ndmg to 
a Lscrip. of tire CLPP1 gene. The ki. may also contain means for de.erm.mng the amount 
of CLPP 1 protein or mRNA in tire tea, sample, and/or means for comparing tire amoun, of 
CLPP1 protein or mRNA in tine ,es. sample <o a conrrol or srandard. The compound or 
agent can be packaged in a suitable conlainer. 

,02151 The invention further provides kirs for assessing tire sm.ab.hty of each of a 

phnality of compounds for inhibiting CLPPl-rela.ed diseases in cells or human subject 
Such ki, include a plurality of compounds .o be tesred, and a reagen. (such as an antibody 
specific .0 CLPP1 proteins, or a polynucleotide probe or primer capable of hybrnhzmg <o 
the CLPP1 gene) for assessing expression of CLPP1. 
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[0216] It should be understood that the above-described embodiments are given by 

way illustration, not limitation. Various changes and modifications within the spirit and 
scope of the present invention will become apparent to those skilled in the art from the 
present description. 

Host Cells 

[0217] Another aspect of the invention pertains to host cells into which a 
polynucleotide molecule of the invention is introduced, e.g., a CLPP1 gene or homolog 
thereof, within an expression vector, a gene delivery vector, or a polynucleotide molecule of 
the invention containing sequences which allow it to homologously recombine into a 
specific site of the host cell's genome. The terms "host cell" and "recombinant host cell- 
are used interchangeably herein. It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential progeny of such a cell. Because 
certain modifications may occur in succeeding generations due to either mutation or 
environmental influences, such progeny may not, in fact, be identical to the parent cell, but 
are still included within the scope of the term as used herein. 

[0218] A host cell can be any prokaryotic or eukaryotic cell. For example, a CLPP1 
gene can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian 
cells (such as Chinese hamster ovary cells (CHO), COS cells, Fischer 344 rat cells, 
HLA-B27 rat cells, HeLa cells, A549 cells, or 293 cells). Other suitable host cells are 
known to those skilled in the art. 

[0219] Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign polynucleotide (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DAKD-dextran-mediated 
transfection, lipofection, or electoporation. 

[0220] For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable flag (e.g., resistance to antibiotics) is generally 
introduced into the host cells along with the gene of interest. Preferred selectable flags 
include those which confer resistance to drugs, such as G418, hygromycin and 
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methorrexate. Po.ynudeotides encoding a se.ec.able flag can be introduced into a host ce 
by the same vector as that encoding CLPP1 or can be introduced by a separate vector. Cells 
stably rransfeeted with the introduced polynucleotide can be identified by drug selectton 
(„. cells that bave incorporated the selectable flag gene will survive, while the other cells 

die). n . 

[0221] A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 

culture, can be used to produce (i.e. , express) CLPP1 . Accordingly, the invention further 
providesmethodsforproducingCLPPl using the host cells of the invention. Inone 
embodiment, the method comprises culturing the host cell of invention (into winch a 
recombinant expression vector containing a CLPP1 gene has been introduced) in a smtable 
medium such that CLPP1 is produced. In another embodiment, the method further 
comprises isolating CLPP1 from the medium or the host cell. 

Trans genic and Knock out Animals 

[0222] The host cells ofthe invention can also be used to produce non-human 
transgenic animal, For example, in one embodim.nl, a host cell of fire invention .s a 
fertilized oooyte or an embryonic stem cell into which CLPP1 -coding sequences bave been 
introduced. Such host cells can then be used ,0 create non-human transgenic antmals m 
which exogenous sequences encoding CLPP1 have been introduced into their genome or 
homologous recombinant animals in which endogenous sequences encoding CLPP1 have 
been altered. Such animals are useful for studying the function and/or activity of CLPP1 
and for identifying and/or evaluating modulators of CLPP1 activity. As used heretn, a 
••transgenic animal" is a non-human animal, preferably a mammal, more preferably a rodent 
such as a ra, or mouse, in which one or more ofthe cells of the annua! inc.udes a transgene. 
Other examples of transgenic animals include non-human primates, sheep, dogs, cows, 
goats, chickens, amphibians, and the like. A rransgene is exogenous DNA whrch ■ 
integrated into the genome of a cel. from which a transgenic animal develops and whtch^ 
remains in the genome ofthe marure animal, thereby directing .he expression of an encoded 
gene product in one or more cell types or tissues ofthe transgenic animal. As used heretr u a 
••homologous recombinant animal" or "knockout animal" is a non-human animal, preferably 
a mamma,, more preferably a mouse, in which an endogenous CLPP1 gene has been ahered 
by homologous recombinafion between the endogenous gene and an exogenous DNA 
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molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to 
development of the animal. 

[0223] A transgenic animal of the invention can be created by introducing an CLPP1- 
encoding polynucleotide into the mate pronuclei of a fertilized oocyte, e.g., by 
microinjection or retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. Intronic sequences and polyadenylation signals can 
also be included in the transgene to increase the efficiency of expression of the transgene. 
A tissue-specific regulatory sequence(s) can be operably linked to a transgene to direct 
expression of CLPP1 to particular cells. Methods for generating transgenic animals via 
embryo manipulation and microinjection, particularly animals such as mice, have become 
conventional in the art. Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of a 
transgene of the invention in its genome and/or expression of mRNA corresponding to a 
gene of the invention in tissues or cells of the animals. A transgenic founder animal can 
then be used to breed additional animals carrying the transgene. Moreover, transgenic 
animals carrying a transgene encoding CLPP1 can further be bred to other transgenic 
animals carrying other transgenes. 

[0224] To create a homologous recombinant animal (knockout animal), a vector is 
prepared which contains at least a portion of a gene of the invention into which a deletion, 
addition or substitution has been introduced to thereby alter, e.g., functionally disrupt, the 
gene The gene can be a human gene, but more preferably, is a non-human homolog of a 
human gene of the invention (e.g., a homolog of the CLPP1 gene). For example, a mouse 
gene can be used to construct a homologous recombination polynucleotide molecule, e.g., a 
vector suitable for altering an endogenous gene of the invention in the mouse genome. In a 
preferred embodiment, the homologous recombination polynucleotide molecule is deigned 
such that, upon homologous recombination, the endogenous gene of the invention is 
functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a 
"knockout" vector). Alternatively, the homologous recombination polynucleotide molecule 
can be designed such that, upon homologous recombination, the endogenous gene is 
mutated or otherwise altered but still encodes functional protein (e.g., the upstream 
regulatory region can be altered to thereby alter the expression of the endogenous CLPP1 
gene) In the homologous recombination polynucleotide molecule, the altered portion of the 
gene of the invention is flanked at its 5' and 3' ends by additional polynucleotide sequence 
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of the gene of the invention to allow for homologous recombination to occur between the 
exogenous gene carried by the homologous recombination polynucleotide molecule and an 
endogenous gene in a cell, e.g., an embryonic stem cell. The additional flanking 
polynucleotide sequence is of sufficient length for successful homologous recombination 
with the endogenous gene. 

[0225] Typically, several kilobases of flanking DNA (both at the 5' and 3' ends) are 
included in the homologous recombination polynucleotide molecule. The homologous 
recombination polynucleotide molecule is introduced into embryonic stem cells by 
electroporation. The cells in which the introduced gene has homologously recombined with 
the endogenous gene are selected. The selected cells can then be injected into a blastocyst 
of an animal (e.g., a mouse) to form aggregation chimeras. A chimeric embryo can then be 
implanted into a suitable pseudopregnant female foster animal and the embryo brought to 
term Progeny harboring the homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal contain the homologously recombined 
DNA by germline transmission of the homologously recombined DNA. Methods for 
constructing homologous recombination polynucleotide molecules, e.g., vectors, or 
homologous recombinant animals are well known in the art. 

[0226] In another embodiment, transgenic non-human animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PL Another 
example of a recombinase system is the FLP recombinase system of Saccharomyces 
cerevisiae (see e.g., O'Gorman et al, Science 251:1351-1355, 1991). If a creAoxP 
recombinase system is used to regulate expression of the transgene, animals containing 
transgenes encoding both the Cre recombinase and a selected protein are required. Such 
animals can be provided through the construction of "double" transgenic animals, e.g.,by 
mating two transgenic animals, one containing a transgene encoding a selected protein and 
the other containing a transgene encoding a recombinase. 

[0227] Clones of the non-human transgenic animals described herein can also be 
produced according to the methods described in Wilmut, I. et al, Nature 385:810-813, 
1997 and PCT International Publication Nos. WO97/07668 and WO97/07669. In brief, a 
cell e g , a somatic cell, from the transgenic animal can be isolated and induced to exit the 
growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through the use 
of electrical pulses, to an enucleated oocyte from an animal of the same species from which 
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ta ascent ce« is iso,a«ed. The reconstituted oocyte is «he„ cuhwed such <ha, ,« devefcps 
,„ morula or blastocyte and ,he„ transferred ,o pseudopregnan, female fosfcr ammaL The 

cell, e.g. , the somatic cell, is isolated. 
Examples 

Example 1: Identification ofCLPPl seauence in human genome database. 
[0228] The nucleic acid sequence of CLPP1 is obtained from a newly developed 
genomic prediction pipeline. Briefly, the X-ray crystal stnrchnes of the catarytic domarns 
of protein phosphatase were collected and allied rogeflrer according ,o the* — _ 
id Ly/siLari«e S . The alignment was convened into a "scoring matrix" w,ch earned *e 
sn.cn.ra. proflle of the phosphatase catalytic domains. This scoring matnx was men «d 
,o search the Ce,era Human Genome database for sequences tha, have phosphatase ca.a.yt.c 
domains. 

Example 2: BLAST analysis 

[0229] Sequence alignments between CLPP1 and other sequences ,n GenBank 
Lhasa were performed using the standard protein-protein BLAST(blastp), stanrfcrd 
n „c,eo„de-nuc,eo.ide BLAST(blastn), BLAST2 Sequences, and human genome BLAST 
programs that are available at NCBIs BLAST website. 

l„2^, A standard protein-protein BLAST search in the "nr" database (avarlable a. 
NCBl s BLAST website) with "Fflrer" sorting unchecked, "Expect" setting a. .0.0, "Word 
Size" setting a, 3, "Matrix" setting a, BLOSUM62, "Gap costs" setting a, Exlstenc « and 
Extension:, , revea,ed ft* CCPP. is identtca, to a hyporhetica, h ™^ 0 "^ 
(Entre z accession number: XM.00270), which was predicted from NCBI con«gNT024„5 
by automated computational ana.ysis using gene predretion meflrod GenomeScan. in 
addition, CLPP. a.so share sequence homologies to a number of proteins, tnc.ud.ng bu, are 
no, limited to. Bacteriophage lambda scr/thr protein phosphatase (Entrez accession number: 
P03772 75% ahgnmen, ,o amino acid residues 55 - 242 of CLPP,), E. co.i ser/thr pro*tn 
phosphate ( Entrez accession number: NC002695, 74% ahgnmen, ,o amino acid restdues 
55 - 242 of CLPP,), an unsown prorein encoded by prophage CP-933K (Enrrez accesston 
number: AE005256, 72% ahgnmen, to amino acid residues 102 - 242 of CLPP,), E. coh 
phosphoprotein phosphatase 1 (Entrez accession number: AP002559, 47% ahgnmen, ,o 
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amino acid residues 60 - 239 of CLPP1), and S. typhimurium protein phosphatase B (Entrez 
accession number: AY049951 , 42% alignment to amino acid residues 43 - 239 of CLPP1). 
[0231] A domain search was performed within the standard protein-protem BLAST 
search with the RPS-BLAST 2.2.3 [Apr-24-2002] program. The amino acid residues 47 to 
21 1 of CLPP1 are highly homologous to the consensus sequence of the calcineunn-hke 
phosphoesterase family (pfam00149). This family includes a diverse range of 
phosphoesterases, including protein phosphoserine phosphatases, nucleot.dases, 
sphingomyelin phosphodiesterases and T-V cAMP phosphodiesterases as well as nucleases 
such as bacterial SbcD or yeast MRE1 1 . The most conserved regions in this famny centre 
around the metal chelating residues. 

[0 232] A standard nucleotide-nucleotide BLAST search in the "pat" database (avadable 
at NCBI's BLAST website) with "Filter" setting unchecked, "Expect" setting at 10.0, "Word 
Size" setting at 3, identified significant alignment of CLPP1 nucleotide sequence to a 
fragment of Sequence 29 from PCT patent application WG99/27132 (Entrez accession 
number- A95298, 99% alignment to nucleotides 771-1326 of CLPP1), and a fragment of 

Sequence 14 from PCT patent application WO02/24924 ( Entrez accession number: 

AX406967 100% alignment to nucleotides 168 - 708 of CLPP1). 

[0233] A human genome search was carried out using blastn program with Expect 

setting at 0.01, Filter setting at default, Descriptions setting at 100, and Alignment settings 
at 100 The CLPP1 gene was mapped to or near locus 10pl5 of human chromosome 10. 
Specifically, CLPP1 gene is located between genes LOCI 19379 and LOG 159671, and 

overlaps with gene LOCI 59669. 

[0234] Human CLPP1 gene has 4 exons. The exons are mapped to nucleotides 
4200965 to 4203759 in human chromosome 10 of the Entrez Human Genome Sequence 
Database maintained by NCBI. The relative location of the exons in the genoxmc sequence 
is listed in Table 2. 

Example 3. Hydrophobicity analysis 

[0235] The hydrophobicity profile of CLPP1 sequence (FIG. 2) was generated usmg 
the GES (Goldman, Engelman and Steitz) hydrophobicity scale (Engelman et oL, Ann. Rev. 
Biophy, Biophys.Chem. 15:321-353, 1986.) Briefly, the GES scale is used to identxfy 
nonpolar transbilayer helices. The curve is the average of a residue-specific hydrophobe 
scale over a window of 20 residues. When the line is in the upper half of the frame 
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(positive), i. indicates a hydrophobic region and when i, is in the lower half (negative), a 

^^^^^^^^^ 

Y-axis represents the GES seore. The curve Hne shows the OES pattern of fire 
^ei whi, the stra,. Hne represents certain cutoff for potentia, membranespamnng 

7w Having described preferred embodiments of compositions, organisms and 
[0237] „„, rLP Pl (which are intended to be illustrative 

methodologies employing a novel human gene CLPP1 (wn>« 
and no, limiting), it is noted tha, modifications and variations can be 
„■„,„ in the art in light of the above teachings. Therefore, ts understood that changes 

described as defined by the appended claims. 
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